Novel nucleic acid and polypeptide

ABSTRACT

A novel isolated nucleic acid which corresponds to a gene located on human chromosome 2p21-16.3 is provided, and a polypeptide encoded thereby, together with mouse and chicken orthologs. The encoded polypeptides share a PGECCPLP motif and include an insulin-like growth factor binding domain, cysteine-rich repeats, an RGD motif and transmembrane domain, and interact with members of the transforming growth factor superfamily. The nucleic acids of the invention, and polypeptides encoded thereby, may be useful in the diagnosis and treatment of diseases including eye defects, neurodegenerative diseases, renal and kidney disease, bone and tooth abnormalities, wounds and skin damage.

CROSS-REFERENCE TO RELATED APPLICATION

[0001] This application is a continuation of International Application No. PCT/AU00/01435, filed Nov. 24, 2000, which was published in English under PCT Article 21(2), the disclosure of which is incorporated by reference herein in its entirety, and which claims priority of Australian Application No. PQ4348, filed Nov. 26, 1999.

FIELD OF THE INVENTION

[0002] THIS INVENTION relates to a novel isolated nucleic acid, and more particularly to an isolated nucleic which corresponds to a gene located on human chromosome 2p21-16.3. The invention also relates to an encoded polypeptide which interacts with members of the transforming growth factor beta (TGFβ) superfamily. The nucleic acids of the invention, and polypeptides encoded thereby, may be useful in the diagnosis and/or treatment of diseases including eye defects, neurodegenerative diseases, renal and kidney disease, bone and tooth abnormalities, wounds and skin damage without limitation thereto.

BACKGROUND OF THE INVENTION

[0003] Vertebrate development is a complex process involving a plethora of genes and gene products whose interactions direct crucial events such as cell fate, pattern formation, organogenesis and, at least to some extent, the development of intelligence and behaviour. Although an overview of this area is beyond the scope of this discussion, it has become clear that by understanding the genetic basis of development, the genetic basis of disease is also more properly understood.

[0004] Much effort has been directed at identifying the genes which control developmental processes and underlie particular diseases. Such genes are useful as tools for diagnosing disease and determining whether an individual is predisposed to a particular disease. This information is also useful in genetic counselling of carriers and affected individuals with respect to the possibility of transmitting the gene to offspring, and for the pre-natal detection of such genes carried by offspring.

[0005] Even more importantly, the genes (and their protein products) which underlie or contribute to particular diseases, may constitute new and efficacious treatments of those particular diseases.

[0006] There is still a great deal to be understood in terms of the genetic basis of vertebrate development and diseases arising from aberrant developmental processes.

SUMMARY OF THE INVENTION

[0007] The present invention is broadly directed to an isolated nucleic acid which corresponds to a gene located on human chromosome 2p21-16.3, or a chromosome structurally and functionally equivalent thereto, and a polypeptide encoded thereby.

[0008] In a first aspect, the invention provides an isolated polypeptide comprising the amino acid sequence PGECCPLP (SEQ ID NO: 1).

[0009] In preferred embodiments, the isolated polypeptide has an amino acid sequence as set forth in FIG. 1, hereinafter referred to as a human CRIM1 (SEQ ID NO: 2), mouse CRIM1 (SEQ ID NO: 3) and chicken CRIM 1 (SEQ ID NO: 4) polypeptide respectively.

[0010] The invention also contemplates biologically-active fragments, variants and derivatives of CRIM1 polypeptides.

[0011] In a second aspect, the invention provides an isolated nucleic acid encoding the polypeptide of the first aspect.

[0012] In particular embodiments, the isolated nucleic acid has a sequence of nucleotides according to FIG. 2, hereinafter referred to as a human Crim1 nucleic acid (SEQ ID NO: 5), a mouse Crim1 nucleic acid.(SEQ ID NO: 6) and a chicken Crim1 nucleic acid (SEQ ID NO: 7).

[0013] In another embodiment, the isolated nucleic acid is pcDNA3-hCRIM1myc deposited under accession number NM00/16530 at AGAL on Nov. 9, 2000.

[0014] In a further embodiment, the isolated nucleic acid has a nucleotide sequence according to FIG. 3 (SEQ ID NOS: 20-24), hereinafter referred to as a human “Crim1 genomic sequence”.

[0015] CRIM1 and S52 are used interchangeably herein, due to recent changes in nomenclature during isolation of the polypeptides and nucleic acids of the invention.

[0016] In a third aspect, the invention provides an expression construct comprising an isolated nucleic acid according to the second aspect.

[0017] In a fourth aspect, the invention provides a host cell comprising the expression construct of the third aspect.

[0018] In a fifth aspect, the invention provides an antibody which is capable of binding a Crim1 polypeptide, biologically-active fragment, variant or derivative thereof.

[0019] In a preferred embodiment, the antibody is capable of binding a peptide having the amino acid sequence RVQVDSSQRMLRIAEPDARFSGFYSMQKQNHLQADNFYQTV (SEQ ID NO: 8) or the amino acid sequence KVCQPGYLNILVSKASGKPGEC (SEQ ID NO: 9).

[0020] In a sixth aspect, the invention provides a pharmaceutical composition comprising an isolated nucleic acid according to the second aspect or an isolated polypeptide according to the first aspect.

[0021] Preferably, the pharmaceutical composition comprises a pharmaceutically-acceptable carrier, diluent or excipient.

[0022] In one embodiment, the pharmaceutical composition is suitable for use in gene therapy.

[0023] In a seventh aspect, the invention provides a method of modulating the activity of a polypeptide of the TGFβ superfamily, said method including the step of administering to an animal a pharmaceutical composition according to the third aspect.

[0024] In an eighth aspect, the invention provides a method of detecting a predisposition to a genetically-inherited disease in an animal, said method including the step of identifying a CRIM1 nucleic acid mutation or polymorphism indicative of said animal being predisposed to, or suffering from, said genetically-heritable disease.

[0025] In a ninth aspect, the invention provides a CRIM1 mimetic.

[0026] In one embodiment, the mimetic is an antagonist of CRIM1.

[0027] In another embodiment, the mimetic is a mimic or agonist of CRIM1.

BRIEF DESCRIPTION OF THE FIGURES

[0028]FIG. 1

[0029] Alignment of amino acid sequences of human (h) CRIM1 polypeptide (SEQ ID NO: 2), mouse (m) CRIM1 polypeptide (SEQ ID NO: 3) and chicken (c) CRIM 1 polypeptide (SEQ ID NO: 4). Conserved amino acids are indicated thus*. Conservative changes are indicated thus.. The conserved PGECCPLP sequence (SEQ ID NO: 1) corresponds to residues 200-207 of human CRIM1.

[0030]FIG. 2

[0031] Alignment of nucleotide sequences of human (h) Crim1 cDNA (SEQ ID NO: 5), mouse (m) Crim1 cDNA (SEQ ID NO: 6) and chicken (c) Crim1 cDNA. (SEQ ID NO: 7). Conserved nucleotides are indicated thus*.

[0032]FIG. 3

[0033] Nucleotide sequence of a partial human Crim1 gene. The 236303 bp sequence includes exons 2-17 of the Crim1 gene. The exons are located as follows: exon 2: 33104-33277; exon 3: 77747-77989; exon 4: 79104-79224; exon 5: 101023-101144; exon 6: 113378-113560; exon 7: 115986-116183; exon 8: 135708-135836; exon 9: 146472-146628; exon 10: 148762-148883; exon 11: 150045-150254; exon 12: 153816-154031; exon 13: 158581-158802; exon 14: 173983-174177; exon 15: 181007-181129; exon 16: 183613-183800; and exon 17: 185153-187765. The Crim1 genomic sequence corresponds to SEQ ID NOS: 20-24.

[0034]FIG. 4

[0035] Alignment of human and mouse CRIM1 polypeptide amino acid sequences together with a putative C. elegans ortholog.

[0036]FIG. 5

[0037] Structural analysis of human and mouse CRIM1 polypeptides.

[0038] A: Domain structure of the putative human and mouse CRIM1 ORFs compared with a C. elegans ortholog and Drosophila Sog and Xenopus chordin. The cleaved signal peptide (SP) is indicated. IGFBP=insulin-like growth factor binding domain; CR=cysteine-rich repeats; TM=transmembrane domain.

[0039] B: Alignment of cysteine-rich repeats of human CRIM1 (H-CR1-6), mouse CRIM1 (M-CR1-6) and a C. elegans ortholog (C-CR1-6).

[0040]FIG. 6

[0041] Northern analysis of Human Crim1 and mouse Crim1 RNA expression.

[0042] A: Mouse embryonic Northern blot probed with mouse Crim1-derived probe. The major Crim1 isoform of 6.4 kB is present at all stages shown.

[0043] B: Multiple Tissue Northern analysis of human Crim1 mRNA expression in the indicated postnatal human tissues. The major 6.0 kB is present in all samples except liver, whilst a minor 4.0 kB isoform is seen in placental RNA.

[0044]FIG. 7

[0045] Whole-mount in situ analysis with digoxigenin-labeled human Crim1 probe.

[0046] A: S52 expression in whole embryos from E9.5 to E13.5 and dissected kidneys from E15.5 to E16.5;

[0047] B: S52 expression in E16.5 kidney;

[0048] C: S52 expression in the floor plate and motor neurons of the spinal cord E12.5;

[0049] D: S52 in caudal somites E12.5.

[0050]FIG. 8

[0051] Whole embryo analysis of Crim1 RNA expression. Expression of Crim1 (A-E) compared with Sonic hedgehog (Shh; G-K)). F=E13.5 embryo expression in eye (a), ear (b) and vibrissae (c). At later stages (E11.5 onwards), lack of probe penetration prevents visualization of staining of internal structures. N=notochord; FP=floor plate; So=somites; MN=motor neurons; LB=limb buds; Ey=eye. Scale Bar: A, G: 80 μ; B, H: 100 μm; C, I: 120 μm; D, J: 145 μm; E, K: 160 μm; F: 300 μm; a,b,c: 40 μm.

[0052]FIG. 9

[0053] Analysis of Crim1 RNA expression in the embryonic mouse central nervous system. A-N=Expression in cervical sections across the developing spinal cord showing comparisons between S52 (A-E), sonic hedgehog (Shh; F-J) and Isl-1 (K-N) at E9.5 (A, F), E10.5 (B, G, K), E11.5 (C, H, L), E12.5 (D, I, M) and E13.5 (E, J, N). O-R=Expression in the developing brain showing comparison between S52 (O, Q) and Shh (P, R) at E10.5 (O, P=transverse section) and at E12.5 (Q, R=sagittal section). NC=notochord; FP=floor plate; MN=motor neurons; I1=dorsal interneuron subtype 1; I2 dorsal subtype 2; DRG=dorsal root ganglion; L=lens; D2=dorsal interneuron subtype (expresses Isl-1); DM=dorsal midline cells of hindbrain; RD=roof of diencephalon; VF=ventral forebrain. Scale bars: A, F: 20 μm; B, G, K: 45 μm; C, H, L: 70 μm; D, I, M: 80 μM; E, J, N: 90 μm; O, P: 80 μm; Q, R: 300 μm.

[0054]FIG. 10

[0055] Radiation hybrid panel mapping of CRIM1.

[0056] A: Primers 3′ PCR 3F and 3′ PCR 3R amplify human but not mouse cDNA. There is no intron in the genomic sequence as the band amplified from human genomic DNA is the same size as that amplified from human cDNA.

[0057] B: S52 is localized between the markers D2S1852 within band 2p21 and DS1409 within 2p16.3 on chromosome 2. pter=terminus of short arm p; qter=terminus of long arm q; Cent=centromere.

[0058]FIG. 11

[0059] Immunofluorescence of 5 day kidney explant culture with antibodies to Calbindin 28K (Sigma; used at 1:200) and polyclonal anti-S52 antibody (used at 1:150). Secondary antibodies were BIODIPY-labeled anti-mouse IgG (1:200; green fluorescence) and Cy3-labeled goat anti-rabbit (1:500; red fluorescence). A-C: 10×magnification of Calbindin 28K (A); S52 (B); and DAPI (C). D-F: 40×magnification of Calbindin 28K (D), S52 (E) and DAPI (F). G, H: Merged images of Calbindin 28K and S52 at 10×(G) and 40×(H) magnification. I: Merged images of DAPI and S52 at 20×magnification.

[0060]FIG. 12

[0061] Subcellular localization of expressed recombinant CRIM1. (A) Schematic description of N- or C-terminally myc-tagged CRIM1 and C-terminally myc-tagged ectodomain (amino acids 1-901 of SEQ ID NO: 2). (B) Immunoblotting analysis of subcellular localization of N-terminal myc-tagged CRIM1. (C) Immunoblotting analysis of subcellular localization of C-terminal myc-tagged CRIM1 ectodomain.(D) Treatment of ectodomain with N-glycosylase. Immunofluorescence analysis of (E) N-terminal myc-tagged CRIM1 in permeabilized cells with anti-myc antibody; (F) N-terminal myc-tagged CRIM1 in permeabilized cells with anti-S52 C-terminal antibody; (G) N-terminal myc-tagged CRIM1 in permeabilized cells merged; (H) N-terminal myc-tagged CRIM1 in non-permeabilized cells with anti-myc antibody; (I) CRIM1 ectodomain in non-permeabilized cells with anti-myc antibody; and (J) CRIM1 in permeabilized cells with anti-myc antibody.

[0062]FIG. 13

[0063] Interaction between CRIM1 and TGF-β superfamily members. (A) Co-immunoprecipitation of CRIM1 and BMP4 preprotein using anti-myc mAb (left panel) and anti-S52 N-terminal antibody (right panel). (B) BMP secretion determined by anti-myc immunoblotting. (C) Secretion of CRIM1 ectodomain determined by anti-myc immunoblotting. (D) Ligand blotting to detect interaction between CRIM1 and TGF-β in aqueuous humour. (E) BMP4 cell overlay assays.

[0064]FIG. 14

[0065] CRIM1 ectodomain bioassays. (A) Ectodomain purification detected by anti-myc immunoblotting (B) In ovo electroporation bioassay in chick spinal chord.GFP indicates gene expression resulting from in ovo electroporation. Isl-1=Islet 1; En1=Engrailed 1; DRG=dorsal root ganglia.

DETAILED DESCRIPTION OF THE INVENTION

[0066] The present invention is predicated, at least in part, by the unexpected discovery of human, mouse and chicken Crim1 nucleic acids and CRIM1 polypeptides encoded thereby. Conservation of CRIM1/Crim1 and its expression during embryonic development suggests that CRIM1/Crim1 may be important for the normal development of vertebrates. In particular, it is proposed that CRIM1 may be involved in neuronal development and/or kidney and gonad development.

[0067] Furthermore, the present inventors provide evidence that CRIM1 polypeptides interact with TGF-β superfamily polypeptides, which polypeptides include TGF-β, a polypeptide known to be involved in eye defects such as cataract formation.

[0068] It is proposed herein that through binding to one or more members of the TGF-β superfamily of growth factors, CRIM1 will augment or antagonize their biological activities.

[0069] As used herein, unless the context requires otherwise, “comprise”, “comprises” or “comprising”, will be understood to imply the inclusion of a stated element or integer or group of elements or integers but not the exclusion of any other element or integer or group of elements or integers.

[0070] Throughout this specification, scientific terms are given their usual scientific meaning, although certain terms are defined herein to assist in their interpretation.

[0071] The term “recombinant” as used herein means artificially produced through human manipulation of genetic material, such as involving techniques generally falling within the scope of “recombinant DNA technology” as is well understood in the art.

[0072] By “isolated” is meant material that has been removed from its natural state or otherwise been subjected to human manipulation. Isolated material may be substantially or essentially free from components that normally accompany it in its natural state, or may be manipulated so as to be in an artificial state together with components that normally accompany it in its natural state. Isolated material may be in recombinant or native form.

[0073] CRIM 1 Polypeptides

[0074] The invention provides CRIM1 polypeptides isolated from human, mouse and chicken, comprising the amino acid sequence PGECCPLP (SEQ ID NO: 1), for example as set forth in FIG. 1 (SEQ ID NOS: 2, 3 and 4 respectively).

[0075] CRIM1 polypeptides are further characterized by the presence of six cysteine-rich domains, an RGD domain, an IGFBP-like domain and a putative transmembrane domain.

[0076] By “polypeptide” is also meant “protein”, either term referring to an amino acid polymer which may include natural and/or non-natural amino acids as are well known in the art.

[0077] A “peptide” is a protein having no more than fifty (50) amino acids.

[0078] A peptide may be a “fragment” of a larger polypeptide, for example of at least 6, preferably at least 10 and more preferably at least 20 amino acids in length. Larger fragments comprising more than one peptide are also contemplated, and may be obtained through the application of standard recombinant nucleic acid techniques or synthesized using conventional liquid or solid phase synthesis techniques. For example, reference may be made to solution synthesis or solid phase synthesis as described, for example, in Chapter 9 entitled “Peptide Synthesis” by Atherton and Shephard which is included in a publication entitled “Synthetic Vaccines” edited by Nicholson and published by Blackwell Scientific Publications. Peptide synthesis is also described in detail in Chapter 18 of CURRENT PROTOCOLS IN PROTEIN SCIENCE, Coligan et al Eds (John Wiley & Sons, 1995-2000), which is incorporated herein by reference. Alternatively, peptides can be produced by digestion of a polypeptide of the invention with proteinases such as endoLys-C, endoArg-C, endoGlu-C and staphylococcins V8-protease. The digested fragments can be purified by, for example, high performance liquid chromatographic (HPLC) techniques.

[0079] The invention also contemplates “biologically-active fragments” of CRIM1 polypeptides.

[0080] Suitably, the biologically-active fragment has at least 1%, preferably at least 10%, more preferably at least 25% and even more preferably at least 50% of the biological activity of a CRIM1 polypeptide.

[0081] An example of a biologically-active fragment is a CRIM1 ectodomain polypeptide comprising amino acids 1-901 of SEQ ID NO: 2.

[0082] As used herein,“variant” polypeptides include CRIM1 polypeptides in which one or more amino acids have been replaced by different amino acids. It is well understood in the art that some amino acids may be changed to others with broadly similar properties without changing the nature of the activity of the polypeptide (conservative substitutions).

[0083] Substantial changes in function are made by selecting substitutions that are less conservative. Other replacements would be non-conservative substitutions and relatively fewer of these may be tolerated. Generally, the substitutions which are likely to produce the greatest changes in a polypeptide's properties are those in which (a) a hydrophilic residue (e.g., Ser or Thr) is substituted for, or by, a hydrophobic residue (e.g., Ala, Leu, Ile, Phe or Val); (b) a cysteine or proline is substituted for, or by, any other residue; (c) a residue having an electropositive side chain (e.g., Arg, His or Lys) is substituted for, or by, an electronegative residue (e.g., Glu or Asp) or (d) a residue having a bulky side chain (e.g., Phe or Trp) is substituted for, or by, one having a smaller side chain (e.g., Ala, Ser)or no side chain (e.g., Gly).

[0084] The term “variant” also includes CRIM1 polypeptides produced from allelic variants of the sequences exemplified in this specification.

[0085] In another embodiment, variant polypeptides share at least 60%, preferably at least 80% and more preferably at least 90% sequence identity with any one of the CRIM 1 amino acid sequences.

[0086] As used herein, “derivative” polypeptides are CRIM1 polypeptides which have been altered, for example by conjugation or complexing with other chemical moieties or by post-translational modification techniques as would be understood in the art. Such derivatives include amino acid deletions and/or additions to polypeptides of the invention.

[0087] Derivative polypeptides may include fusions of a CRIM1 polypeptide with another polypeptide or protein. Well known examples of such proteins include Protein A, glutathione S-transferase (GST), green fluorescent protein (GFP) maltose-binding protein (MBP), hexahistidine (HIS₆) and epitope tags such as FLAG, haemagglutinin and c-myc tags.

[0088] An example of a c-myc tagged CRIM1 polypeptide is provided hereinafter.

[0089] Other derivatives contemplated by the invention include, but are not limited to, modification to side chains, incorporation of unnatural amino acids and/or their derivatives during peptide, polypeptide or protein synthesis and the use of crosslinkers and other methods which impose conformational constraints on the polypeptides, fragments and variants of the invention. Examples of side chain modifications contemplated by the present invention include modifications of amino groups such as by acylation with acetic anhydride; acylation of amino groups with succinic anhydride and tetrahydrophthalic anhydride; amidination with methylacetimidate; carbamoylation of amino groups with cyanate; pyridoxylation of lysine with pyridoxal-5-phosphate followed by reduction with NaBH₄; reductive alkylation by reaction with an aldehyde followed by reduction with NaBH₄; and trinitrobenzylation of amino groups with 2,4,6-trinitrobenzene sulphonic acid (TNBS).

[0090] The carboxyl group may be modified by carbodiimide activation via O-acylisourea formation followed by subsequent derivitization, by way of example, to a corresponding amide.

[0091] The guanidine group of arginine residues may be modified by formation of heterocyclic condensation products with reagents such as 2,3-butanedione, phenylglyoxal and glyoxal.

[0092] Sulphydryl groups may be modified by methods such as performic acid oxidation to cysteic acid; formation of mercurial derivatives using 4-chloromercuriphenylsulphonic acid, 4-chloromercuribenzoate; 2-chloromercuri-4-nitrophenol, phenylmercury chloride, and other mercurials; formation of a mixed disulphides with other thiol compounds; reaction with maleimide, maleic anhydride or other substituted maleimide; carboxymethylation with iodoacetic acid or iodoacetamide; and carbamoylation with cyanate at alkaline pH.

[0093] Tryptophan residues may be modified, for example, by alkylation of the indole ring with 2-hydroxy-5-nitrobenzyl bromide or sulphonyl halides or by oxidation with N-bromosuccinimide.

[0094] Tyrosine residues may be modified by nitration with tetranitromethane to form a 3-nitrotyrosine derivative.

[0095] The imidazole ring of a histidine residue may be modified by N-carbethoxylation with diethylpyrocarbonate or by alkylation with iodoacetic acid derivatives.

[0096] Examples of incorporating unnatural amino acids and derivatives during peptide synthesis include but are not limited to, use of 4-amino butyric acid, 6-aminohexanoic acid, 4-amino-3-hydroxy-5-phenylpentanoic acid, 4-amino-3-hydroxy-6-methylheptanoic acid, t-butylglycine, norleucine, norvaline, phenylglycine, omithine, sarcosine, 2-thienyl alanine and/or D-isomers of amino acids.

[0097] CRIM1 polypeptides of the invention, fragments, variants and derivatives, are readily made in recombinant form, as will be described in more detail hereinafter.

[0098] Generally, recombinant proteins may be conveniently prepared by a person skilled in the art using standard protocols as for example described in Sambrook, et al., MOLECULAR CLONING. A Laboratory Manual (Cold Spring Harbor Press, 1989), incorporated herein by reference, in particular Sections 16 and 17; CURRENT PROTOCOLS IN MOLECULAR BIOLOGY Eds. Ausubel et al., (John Wiley & Sons, Inc. 1995-1999), incorporated herein by reference, in particular Chapters 10 and 16; and CURRENT PROTOCOLS IN PROTEIN SCIENCE Eds. Coligan et al., (John Wiley & Sons, Inc. 1995-1999) which is incorporated by reference herein, in particular Chapters 1, 5, 6 and 7.

[0099] With regard to polypeptide variants, these can be created by mutagenizing a polypeptide or by mutagenizing an encoding nucleic acid, such as by random mutagenesis or site-directed mutagenesis. Examples of nucleic acid mutagenesis methods are provided in in Chapter 9 of CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, Ausubel et al., supra which is incorporated herein by reference.

[0100] It will be appreciated by the skilled person that site-directed mutagenesis is best performed where knowledge of the amino acid residues that contribute to biological activity is available. In many cases, this information is not available, or can only be inferred by molecular modelling approximations, for example.

[0101] In such cases, random mutagenesis is contemplated. Random mutagenesis methods include chemical modification of proteins by hydroxylamine (Ruan et al., 1997, Gene 188 35), incorporation of dNTP analogs into nucleic acids (Zaccolo et al., 1996, J. Mol. Biol. 255 589) and PCR-based random mutagenesis such as described in Stemmer, 1994, Proc. Natl. Acad. Sci. USA 91 10747 or Shafikhani et al., 1997, Biotechniques 23 304, each of which references is incorporated herein. It is also noted that PCR-based random mutagenesis kits are commercially available, such as the Diversify™ kit (Clontech).

[0102] Antibodies

[0103] The invention also provides antibodies capable of binding Crim1 polypeptides, biologically-active fragments, variants or derivatives thereof. Such antibodies may include any suitable antibodies which bind to or conjugate with a polypeptide of the invention, homolog or fragment thereof. Such antibodies may be polyclonal, obtained for example by immunizing an animal with a polypeptide, homolog or fragment thereof. It is for this purpose that peptides of the invention are particularly useful. Suitably, said animal could be a mouse, rat, rabbit or goat. Preferably, the animal is a rabbit.

[0104] Alternatively, monoclonal antibodies may be produced by a standard method such as described in CURRENT PROTOCOLS IN IMMUNOLOGY (1994, Eds. Coligan, Kruisbeek, Marguiles, Shevach and Strober; John Wiley & Sons), which is hereby incorporated by reference. Such a method would involve obtaining antibody-producing cells, such as spleen cells, from an animal immunized as described above, and immortalizing said cell, such as by fusion with an immortalized fusion partner cell.

[0105] Preferably, the antibody is a polyclonal antibody.

[0106] Advantageously, the antibody is a rabbit polyclonal antibody.

[0107] In one embodiment, the antibody is raised against the following amino acid sequence: RVQVDSSQRMLRIAEPDARFSGFYSMQKQNHLQADNFYQTV (SEQ ID NO: 8), which sequence corresponds to the C-terminal 41 amino acids of human CRIM1.

[0108] In another embodiment, the antibody is raised against the following amino acid sequence: KVCQPGYLNILVSKASGKPGEC (SEQ ID NO: 9), which sequence corresponds to the N-terminal amino acids of mouse CRIM1.

[0109] As is well understood in the art, antibodies may be conjugated with labels selected from a group including a chromogen, a catalyst, an enzyme, a fluorophore, a chemiluminescent molecule and a radioisotope.

[0110] A large number of enzymes suitable for use as labels is disclosed in United States Patent Specifications U.S. Pat. No. 4,366,241, U.S. Pat. No. 4,843,000, and U.S. Pat. No. 4,849,338, each of which is herein incorporated by reference. Suitable enzyme labels useful in the present invention include alkaline phosphatase, horseradish peroxidase, luciferase, β-galactosidase, glucose oxidase, lysozyme, malate dehydrogenase and the like. The enzyme label may be used alone or in combination with a second enzyme in solution.

[0111] Fluorophores may be selected from a group including fluorescein isothiocyanate (FITC), tetramethylrhodamine isothiocyanate (TRITC), allophycocyanin (APC), Texas Red (TR), Cy5 or R-Phycoerythrin (RPE). Examples of useful fluorophores may be found, for example, in U.S. Pat. No. 4,520,110 and U.S. Pat. No. 4,542,104 which are herein incorporated by reference.

[0112] CRIM1 Mimetics

[0113] The invention contemplates mimetics which antagonize or mimic one or more biological activities of CRIM1 polypeptides, or homologs of CRIM1.

[0114] It will be appreciated that CRIM1 comprises an RGD domain, cystein-rich domains, an IGFBP-like domain and transmembrane domain.

[0115] Of these, the six cysteine-rich repeats are considered to be preferred targets for the screening or design of potential CRIM1 mimetics.

[0116] The term “mimetics” is used herein to refer to molecules that resemble particular functional regions of proteins or peptides, and includes within its scope the terms “agonist”, “partial agonist”, “analogue” and “antagonist” as are well understood in the art.

[0117] The aforementioned mimetics may themselves be peptides or polypeptides, or may be other organic molecules, preferably small organic molecules, with a desired biological activity and half-life.

[0118] Mimetics may be identified by way of screening libraries of molecules such as synthetic chemical libraries, including combinatorial libraries, by methods such as described in Nestler & Liu, 1998, Comb. Chem. High Throughput Screen. 1 113 and Kirkpatrick et al., 1999, Comb. Chem. High Throughput Screen 2 211.

[0119] It is also contemplated that libraries of naturally-occurring molecules may be screened by methodology such as reviewed in Kolb, 1998, Prog. Drug. Res. 51 185.

[0120] More rational approaches to designing mimetics may employ computer assisted screening of structural databases, computer-assisted modelling, or more traditional biophysical techniques which detect molecular binding interactions, as are well known in the art.

[0121] Computer-assisted structural database searching is becoming increasingly utilized as a procedure for identifying mimetics.. Database searching methods which, in principle, may be suitable for identifying mimetics, may be found in International Publication WO 94/18232 (directed to producing HIV antigen mimetics), U.S. Pat. No. 5,752,019 and International Publication WO 97/41526 (directed to identifying EPO mimetics), each of which is incorporated herein by reference.

[0122] Generally, other methods include a variety of biophysical techniques which identify molecular interactions. Methods applicable to potentially useful techniques such as competitive radioligand binding assays, analytical ultracentrifugation, microcalorimetry, surface plasmon resonance and optical biosensor-based methods are provided in Chapter 20 of CURRENT PROTOCOLS IN PROTEIN SCIENCE Eds. Coligan et al., (John Wiley & Sons, 1997) which is incorporated herein by reference.

[0123] Crim1 Nucleic Acids

[0124] The invention provides isolated Crim1 nucleic acids, as for example set forth in FIG. 2 (SEQ ID NOS: 5-7).

[0125] The invention also provides the genomic sequence of FIG. 3, which sequence includes exons 2-17 of the Crim1 gene located on human chromosome 2p2l-16.3 (SEQ ID NOS: 20-24).

[0126] The term “nucleic acid” as used herein designates single-or double-stranded mRNA, RNA, cRNA and DNA, said DNA inclusive of cDNA and genomic DNA.

[0127] A “polynucleotide” is a nucleic acid having eighty (80) or more contiguous nucleotides, while an “oligonucleotide” has up to eighty (80) contiguous nucleotides.

[0128] A “probe” may be a single or double-stranded oligonucleotide or polynucleotide, suitably labeled for the purpose of detecting complementary sequences in Northern or Southern blotting, for example.

[0129] A “primer” is usually a single-stranded oligonucleotide, preferably having 15-50 contiguous nucleotides, which is capable of annealing to a complementary nucleic acid “template” and being extended in a template-dependent fashion by the action of a DNA polymerase such as Taq polymerase, RNA-dependent DNA polymerase or Sequenase™.

[0130] The present invention also contemplates homologs of Crim1 nucleic acids of the invention.

[0131] In one embodiment, nucleic acid homologs encode polypeptide homologs of the invention, inclusive of variants, fragments and derivatives thereof.

[0132] In another embodiment, nucleic acid homologs share at least 60%, preferably at least 70%, more preferably at least 80%, or even more preferably at least 90% sequence identity with the nucleotide sequences of any one of SEQ ID NOS: 4-7 or SEQ ID NOS: 20-24.

[0133] As generally used herein, a “homolog” shares a definable nucleotide or amino acid sequence relationship with a nucleic acid or polypeptide of the invention as the case may be.

[0134] Included within the scope of homologs are “orthologs”, which are functionally-related polypeptides and their encoding nucleic acids, isolated from different organisms. It will be appreciated that the CRIM1 polypeptides and Crim1 nucleic acids isolated from human, mouse and chicken constitute a family of orthologs.

[0135] Terms used herein to describe sequence relationships between respective nucleic acids and polypeptides include “comparison window”, “sequence identity”, “percentage of sequence identity” and “substantial identity”. Because respective nucleic acids/polypeptides may each comprise (1) only one or more portions of a complete nucleic acid/polypeptide sequence that are shared by the nucleic acids/polypeptides, and (2) one or more portions which are divergent between the nucleic acids/polypeptides, sequence comparisons are typically performed by comparing sequences over a “comparison window” to identify and compare local regions of sequence similarity. A “comparison window” refers to a conceptual segment of typically at least 6 contiguous residues that is compared to a reference sequence. The comparison window may comprise additions or deletions (i.e., gaps) of about 20% or less as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the respective sequences. Optimal alignment of sequences for aligning a comparison window may be conducted by computerised implementations of algorithms (Geneworks program by Intelligenetics; GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package Release 7.0, Genetics Computer Group, 575 Science Drive Madison, Wis., USA, incorporated herein by reference) or by inspection and the best alignment (i.e., resulting in the highest percentage homology over the comparison window) generated by any of the various methods selected. Reference also may be made to the BLAST family of programs as for example disclosed by Altschul et al., 1997, Nucl. Acids Res. 25 3389, which is incorporated herein by reference.

[0136] A detailed discussion of sequence analysis can be found in Unit 19.3 of CURRENT PROTOCOLS IN MOLECULAR BIOLOGY Eds. Ausubel et al. (John Wiley & Sons Inc NY, 1995-1999).

[0137] The term “sequence identity” is used herein in its broadest sense to include the number of exact nucleotide or amino acid matches having regard to an appropriate alignment using a standard algorithm, having regard to the extent that sequences are identical over a window of comparison. Thus, a “percentage of sequence identity” is calculated by comparing two optimally aligned sequences over the window of comparison, determining the number of positions at which the identical nucleic acid base (e.g., A, T, C, G, I) occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison (i.e., the window size), and multiplying the result by 100 to yield the percentage of sequence identity. For example, “sequence identity” may be understood to mean the “match percentage” calculated by the DNASIS computer program (Version 2.5 for windows; available from Hitachi Software engineering Co., Ltd., South San Francisco, Calif., USA).

[0138] Homologs therefore include nucleic acids of the invention which have nucleotide substitutions, deletions or additions which do not substantially alter functional characteristics of polypeptides encoded thereby.

[0139] In this regard, a skilled addressee would realize that advantage can be taken of codon sequence redundancy to incorporate changes in a nucleotide sequence without affecting the encoded amino acid sequence of a polypeptide encoded thereby. Furthermore, nucleic acids may be altered so as to introduce “conservative” amino acid changes which, although altering an amino acid sequence, do not affect functional characteristics of polypeptides encoded thereby.

[0140] Nucleic acid homologs of the invention may also comprise nucleic acids which hybridize with isolated Crim1 nucleic acids of the invention under at least low stringency conditions, preferably at least medium stringency conditions, or more preferably at least high stringency conditions.

[0141] “Hybridization” is used herein to denote the pairing of complementary bases of distinct nucleic acids to produce a DNA-DNA hybrid, a DNA-RNA hybrid, or an RNA-RNA hybrid according to base-pairing rules.

[0142] Typically, hybridizing nucleic acids are identified by blotting techniques that include a step whereby polynucleotides are immobilized on a matrix (preferably a synthetic membrane such as nitrocellulose), a hybridization step, a washing step and a detection step.

[0143] Southern blotting is used to identify a complementary DNA sequence; Northern blotting is used to identify a complementary RNA sequence. Dot blotting and slot blotting can be used to identify complementary DNA/DNA, DNA/RNA or RNA/RNA nucleic acids. Such techniques are well known by those skilled in the art, and have been described in CURRENT PROTOCOLS IN MOLECULAR BIOLOGY (Eds. Ausubel et al., John Wiley & Sons Inc 1995) at pages 2.9.1 through 2.9.20. According to such methods, Southern blotting involves separating DNA molecules according to size by gel electrophoresis, transferring the size-separated DNA to a synthetic membrane, and hybridizing the membrane bound DNA to a complementary nucleic acid labeled radioactively, enzymatically or fluorochromatically. In dot blotting and slot blotting, DNA samples are directly applied to a synthetic membrane prior to hybridization as above.

[0144] Similarly, a blotting step is used when identifying complementary nucleic acids in a cDNA or genomic DNA library, such as through the process of plaque or colony hybridization. A typical example of this procedure is described in Sambrook et al., MOLECULAR CLONING: A LABORATORY MANUAL 2nd Ed (Cold Spring Harbour Press 1989) Chapters 8-12, which is herein incorporated by reference.

[0145] “Stringency” as used herein, refers to the temperature and ionic strength conditions, and presence or absence of certain organic solvents, during hybridization. The higher the stringency, the higher will be the degree of complementarity between the immobilized nucleic acids and the labeled nucleic acid.

[0146] “Stringent conditions” designates those conditions under which only nucleic acids having a high frequency of complementary bases will hybridize, and remain hybridized during washing.

[0147] Reference herein to low stringency conditions includes and encompasses:—

[0148] (i) from at least about 1% v/v to at least about 15% v/v formamide and from at least about 1 M to at least about 2 M salt for hybridisation at 42° C., and at least about 1 M to at least about 2 M salt for washing at 42° C.; and

[0149] (ii) 1% Bovine Serum Albumin (BSA), 1 mM EDTA, 0.5 M NaHPO₄ (pH 7.2), 7% SDS for hybridization at 65° C., and (i) 2×SSC, 0.1% SDS; or (ii) 0.5% BSA, 1 mM EDTA, 40 mM NaHPO₄ (pH 7.2), 5% SDS for washing at room temperature.

[0150] Medium stringency conditions include and encompass:—

[0151] (i) from at least about 16% v/v to at least about 30% v/v formamide and from at least about 0.5 M to at least about 0.9 M salt for hybridisation at 42° C., and at least about 0.5 M to at least about 0.9 M salt for washing at 42° C.; and

[0152] (ii) 1% Bovine Serum Albumin (BSA), 1 mM EDTA, 0.5 M NaHPO₄ (pH 7.2), 7% SDS for hybridization at 65° C. and (a) 2×SSC, 0.1% SDS; or (b) 0.5% BSA, 1 mM EDTA, 40 mM NaHPO₄ (pH 7.2), 5% SDS for washing at 42° C.

[0153] High stringency conditions include and encompass:—

[0154] (i) from at least about 31% v/v to at least about 50% v/v formamide and from at least about 0.01 M to at least about 0.15 M salt for hybridisation at 42° C., and at least about 0.01 M to at least about 0.15 M salt for washing at 42° C.;

[0155] (ii) 1% BSA, 1 mM EDTA, 0.5 M NaHPO₄ (pH 7.2), 7% SDS for hybridization at 65° C., and (a) 0.1×SSC, 0.1% SDS; or (b) 0.5% BSA, 1 mM EDTA, 40 mM NaHPO₄ (pH 7.2), 1% SDS for washing at a temperature in excess of 65° C. for about one hour; and

[0156] (iii) 0.2×SSC, 0.1% SDS for washing at or above 68° C. for about 20 minutes.

[0157] In general, the T_(m) of a duplex DNA decreases by about 1° C. with every increase of 1% in the number of mismatched bases.

[0158] Notwithstanding the above, stringent conditions are well known in the art, such as described in Chapters 2.9 and 2.10 of. Ausubel et al, supra, which are herein incorporated be reference. A skilled addressee will also recognize that various factors can be manipulated to optimize the specificity of the hybridization. Optimization of the stringency of the final washes can serve to ensure a high degree of hybridization.

[0159] In another embodiment, nucleic acid homologs may be prepared according to the following procedure:

[0160] (i) creating primers which, optionally, are degenerate wherein each comprises a respective portion of an isolated Crim1 nucleic acid; and

[0161] (ii) using said primers to amplify, via nucleic acid amplification techniques, one or more amplification products from a nucleic acid extract.

[0162] Suitable nucleic acid amplification techniques are well known to the skilled addressee, and include polymerase chain reaction (PCR) as for example described in Chapter 15 of Ausubel et al. supra, which is incorporated herein by reference; strand displacement amplification (SDA) as for example described in U.S. Pat. No 5,422,252 which is incorporated herein by reference; rolling circle replication (RCR) as for example described in Liu et al., 1996, J. Am. Chem. Soc. 118 1587, International application WO 92/01813 and International Application WO 97/19193, which are incorporated herein by reference; nucleic acid sequence-based amplification (NASBA) as for example described by Sooknanan et al.,1994, Biotechniques 17 1077 which is incorporated herein by reference; ligase chain reaction (LCR) as for example described in International Application WO89/09385 which is incorporated by reference herein; and Q-β replicase amplification as for example described by Tyagi et al., 1996, Proc. Natl. Acad. Sci. USA 93 5395) which is incorporated herein by reference.

[0163] As used herein, an “amplification product” refers to a nucleic acid product generated by nucleic acid amplification techniques.

[0164] The nucleic acid extract may be an extract obtained from cells, tissues or biological fluids in the form of mRNA, or as cDNA reverse transcribed therefrom. The extract may be in the form of a cDNA or genomic library. In this regard, the cDNA or genomic library is preferably derived from a eukaryote, and advantageously from mammals such as humans. Such libraries may comprise genomic DNA or cDNA ligated into vectors such as will be described hereinafter.

[0165] Expression Constructs

[0166] The invention provides an expression construct which comprises an isolated Crim1 nucleic acid or homolog thereof, operably linked to one or more regulatory sequences in an expression vector.

[0167] An example of an expression construct of the invention is pcDNA3-hCRIM1myc deposited under accession number NM00/16530 at AGAL on Nov. 9, 2000.

[0168] Regulatory nucleotide sequences present in the expression vector (such as an enhancer, promoter, splice donor/acceptor signals, terminator and polyadenylation sequences) that will facilitate expression of the polypeptide of the invention. Selectable markers are also useful whether for the purposes of selection of transformed bacteria (such as bla, kanR and tetR) or transformed mammalian cells (such as hygromycin, G418 and puromycin).

[0169] Both constitutive and inducible promoters may be useful for expression of Crim 1 polypeptides according to the invention. Examples of inducible promoters are metallothionine-inducible and tetracycline-repressible systems as are well known in the art.

[0170] An expression construct may also include a fusion partner sequence as hereinbefore defined (such as myc of GFP) so that the recombinant polypeptide of the invention is expressed as a fusion polypeptide with said fusion partner.

[0171] Suitable host cells for Crim1/CRIM1 expression include bacteria (eg. E coli. DH5α), yeast, insect cells (eg. Sf9), Xenopus oocytes and mammalian cells such as CHO and COS lines.

[0172] Expression constructs also include gene therapy constructs, which employ specialized gene therapy vectors such as vaccinia, and viral vectors useful in gene therapy. The latter include adenovirus and adenovirus-associated viruses (AAV) such as described in Franceschi et al., 2000, J. Cell Biochem. 78 476, Braun-Falco et al.,1999, Gene Ther. 6 432, retroviral and lentiviral vectors such as described in Buchshacher et al, 2000, Blood 95 2499 and vectors derived from herpes simplex virus and cytomegalovirus. A general review of gene therapy vectors may be found in Robbins et al., 1998, Trends in Biotech. 16 35. An overview of viral vectors useful in endocrine gene therapy is provided in Stone et al., 2000, J. Endocrinol. 164 103.

[0173] If “anti-sense” therapy is contemplated, then one or more selected portions of said Crim1 nucleic acid may be oriented 3′→5′ in the gene therapy vector.

[0174] Pharmaceutical Compositions

[0175] The invention provides pharmaceutical compositions comprising a CRIM1 polypeptide or a Crim1 nucleic acid and a pharmaceutically-acceptable carrier, diluent or excipient. Also contemplated are pharmaceutical compositions comprising a CRIM1 mimetic or homologs of Crim1 nucleic acids and encoded polypeptides.

[0176] Pharmaceutical compositions comprising a Crim1 nucleic acid are preferably in the form of a gene therapy construct.

[0177] Such pharmaceutical compositions may be useful in prophylactic or therapeutic treatments for diseases including a neurodegenerative disease such as motor neuron disease, diseases of the eye or more particularly the eye lens, such as glaucoma, cataracts, micropthalmia, microcornea or nuclear scelorosis of the lens. Other possible diseases may relate to heart development, kidney and gonad development, tooth development and bone morphogenesis and healing of wounds and damaged skin.

[0178] “Pharmaceutically-acceptable carriers, diluents and excipients” include a solid or liquid filler, diluent or encapsulating substance which may be safely used in systemic administration. Depending upon the particular route of administration, a variety of pharmaceutically-acceptable carriers, diluents or excipients, well known in the art may be used. These may be selected from a group including sugars, starches, cellulose and its derivatives, malt, gelatine, talc, calcium sulfate, vegetable oils, synthetic oils, polyols, alginic acid, phosphate buffered solutions, emulsifiers, isotonic saline, and pyrogen-free water.

[0179] Dosage forms include tablets, dispersions, suspensions, injections, solutions, syrups, troches, capsules, suppositories, topically administered powders, aerosols and emulsions, transdermal patches, gels, pastes and the like. These dosage forms may also include controlled release devices or other forms of implants modified to act in this fashion. Controlled release of the therapeutic agent may be effected by coating the same, for example, with hydrophobic polymers including acrylic resins, waxes, higher aliphatic alcohols, polylactic and polyglycolic acids and certain cellulose derivatives such as hydroxypropylmethyl cellulose. In addition, the controlled release may be effected by using other polymer matrices, liposomes and/or microspheres.

[0180] Compositions of the present invention may be suitable for administration orally or by injection, and in such cases may be presented as discrete units such as capsules, sachets or tablets, as a powder or granules, or as a solution or a suspension in an aqueous liquid, a non-aqueous liquid, an oil-in-water emulsion or a water-in-oil liquid emulsion.

[0181] Administration of the gene therapy construct to an animal, preferably a human individual, may include delivery via direct oral intake, systemic injection, or delivery to selected tissue(s) or cells, or indirectly via delivery to cells isolated from the mammal or a compatible donor. An example of the latter approach would be stem-cell therapy, wherein isolated stem cells having potential for growth and differentiation are transfected with a gene therapy construct which includes a Crim1 nucleic acid or homolog. The stem-cells are cultured for a period and then transferred to the animal being treated.

[0182] Delivery of said gene therapy construct to cells or tissues of said mammal or said compatible donor may be facilitated by microprojectile bombardment, liposome mediated transfection (e.g. lipofectin or lipofectamine), electroporation, calcium phosphate or DEAE-dextran-mediated transfection, for example. A discussion of suitable delivery methods may be found in Chapter 9 of CURRENT PROTOCOLS IN MOLECULAR BIOLOGY (Eds. Ausubel et al.; John Wiley & Sons Inc., 1997 Edition), for example, which is herein incorporated by reference.

[0183] Therapeutic Uses

[0184] Conservation of CRIM1/Crim1 and its expression during embryonic development suggests that CRIM1/Crim1 may be important for normal development of vertebrates. For example, mutations of this gene may be involved in human disease. Based on data which will be provided in more detail hereinafter, CRIM1 polypeptide and Crim1 nucleic acids may be useful in treating diseases including neurodegenerative disease such as motor neuron disease, diseases of the eye or more particularly the eye lens, such as glaucoma, cataracts, micropthalmia, microcomea or nuclear scelorosis of the lens. Other possible diseases may relate to heart development, kidney and gonad development, tooth development and/or bone morphogenesis and healing of wounds and damaged skin.

[0185] Members of the TGFβ superfamily, including the bone morphogenetic proteins (BMPs), are expressed during, and are critical for, early embryo development and organogenesis of a variety of organs, including the kidney, eye (cornea and lens), heart, skeleton, tooth. limb and central nervous system. By interacting with members of the (bone morphogenic protein) BMP, GDF, activin or TGFβ family, CRIM1 may have a role in development, remodelling or repair of these various organs. More specifically, CRIM1 may be able to modulate activities such as bone remodelling, tissue regeneration and motor neuron specification. The genes may be useful in the diagnosis of diseases of these organs and the proteins or genes encoding them in some form of gene therapy construct in the treatment of such conditions.

[0186] Administration of pharmaceutical compositions comprising CRIM1 polypeptide or Crim1 nucleic acid may act to modulate the activity activity of BMP family molecules and TGFβ family members, and other molecules such as IGFs.

[0187] Chordins are another family of proteins which bind BMP family members. Referring to U.S. Pat. No. 5,846,770, U.S. Pat. No. 5,679,783 and U.S. Pat. No. 5,986,056 (each of which is incorporated herein by reference), it is clear that human chordin, for example, may inhibit or stimulate BMPs and thereby display a number of therapeutic effects.

[0188] Therefore, like chordin, it is expected that administration of CRIM1/Crim1, alone or together with other therapeutic proteins or nucleic acids encoding same, may have a variety of therapeutic effects. Examples of other therapeutic proteins contemplated by the present invention include fibroblast growth factor (FGF), activins, inhibins, insulin, insulin-like growth factor (IGF) and epidermal growth factor (EGF). In this regard, it is also noted that the presence of an IGF-binding domain in CRIM1 suggests that CRIM1 may be useful in treating IGF- and/or insulin-related conditions such as insulin-dependent diabetes, skin damage such as ulcers, burns and abrasions, in wound healing and related tissue repair.

[0189] CRIM1 is strongly expressed in developing lens. The lens expression persists postnatally but becomes restricted to the epithelial cells at the front of the lens. These cells must be actively maintained in a single layer of epithelium to prevent the obstruction of vision. Disturbances to these cells result in anterior cataract and aftercataract. CRIM1 is likely to play a role in maintaining this layer of cells as an epithelium. Hence, CRIM1 may act in an anti-cataractogenic fashion. Significant evidence exists to suggest that within the eye, TGFβ, particularly TGFβ2, can act in a cataractogenic fashion, leading to disruption in the morphology of the epithelial cells covering the front of the lens. The result of addition of recombinant TGFβ2 to lens explants cultures is the production of plaques identical in histology to those seen arising after surgery for the removal of cataracts. These anterior cataracts are referred to as ‘after cataract’. The suggestion that inhibitors of TGFβ will act in an anti-cataractogenic fashion is supported by the disclosures of International Publication WO95/13827 and International Application WO98/26784, each of which is incorporated herein by reference.

[0190] By inhibiting the activity of TGFβ on the lens epithelial cells, CRIM1 may have considerable utility as an anti-cataractogenic agent. This may be developed as a gel or infusion for insertion into the lens capsule at the time of lens replacement operations to protect from after cataract. It may also be deliverable via the aqueous humor to prevent the onset of anterior cataract. In either of these situations, CRIM1 function may be able to be mimicked by a peptide similarly antagonising TGFβ. Such a peptidomimetic may be designed from further analysis of the CRIM1-TGFβ interaction.

[0191] Several of the BMPs have the property of generating bone, as suggested by their name. These include BMP2 and 7. BMP7, also called Osteogenic Protein 1 (OP-1), is already known to have potential in bone remodelling, including the treatment of periodontal and orthopedic indications such as fractures. The current approach of mixing OP-1 and a purified collagen matrix into a paste that is applied during surgery may also be applicable to CRIM1. If CRIM1 facilitates the activity of BMP7, this would make it an important accessory for OP-1 treatments. OP-1, while well-established for inducing orthotopic and ectopic bone formation may suffer from limited clinical usefulness as a regenerative agent due to a short in vivo half-life and low specific activity (Franceschi et al, 2000, supra). If CRIM1 stabilises or facilitates an increased BMP half-life, inclusion of this protein in preparations of OP-1 may increase the duration and specificity of the effect. With regard to OP-1 and suitable formulations and delivery of same which may be applicable to the present invention, the skilled person is referred to U.S. Pat. No. 4,968,590, U.S. Pat. No. 5,597,897, U.S. Pat. No. 5,258,494 and U.S. Pat. No. 5,266,683, each of which is incorporated herein by reference.

[0192] Several of the BMPs are expressed strongly during the development of the kidney, including BMP2,4 ,5 and 7. A recent review of the filed relating to BMPs and kidney development is provided in Godin et al., 1999, Int. J. Dev. Biol. 43 405, which is incorporated herein by reference.

[0193] More particularly, a knockout mouse model of BMP7 −/− reveals significant eye and kidney defects suggesting an important role for BMP7 in the formation of these organs. The kidney defects include renal dysgenesis, cystic kidneys or agenesis. BMP7 is expressed in both the ureteric epithelium and the mesenchyme throughout embryonic development and has been shown to function as a survival factor for the nephrogenic mesenchyme. However, at high concentrations, BMP7 appears to also function as an anti-differentiation factor for the metanephric mesenchyme. CRIM1 shows overlapping expression patterns with BMP2 and BMP7, particularly during the formation of the pretubular aggregates and the comma-shaped bodies, which go on to form the proximal portion of the nephrons of the kidney. The data presented hereinafter suggesting that the presence of a BMP can lead to the liberation of secreted CRIM1 protein may suggest that CRIM1 modulates the roles of BMPs in nephron formation. Stimulation of the anti-differentiative activity of BMP7 during kidney development may make CRIM1 and important protein in the growth and expansion of metanephric mesenchymal populations. The ability to derive and expand such a population of mesenchyme fated for kidney development will be critical in the development of kidney regeneration technologies.

[0194] BMP7 expression in the kidney continues after birth, as are receptors for BMP on the podocytes within the glomeruli. Application of BMP7 has been shown to decrease the loss of kidney function associated with acute ischaemic injury (Vukicevic et al., 1998, J. Clin. Invest. 102 202). It can also inhibit tubulointerstitial fibrosis and inflammation after unilateral ureteral obstruction. CRIM1 may similarly assist in such conditions by facilitating or increasing the duration of such BMP7 activity. BMP7 (OP-1) has been found to be preventative for renal fibrosis associated with ureteral obstruction. OP-1 administration can prevent tubular atrophy and diminish the activation of tubulointerstitial inflammation and fibrosis, thereby preserving renal function (Hrusuka et al., 2000, Am. J. Renal. Physiol. 279 130).

[0195] By augmenting or potentiating OP-1 activity, CRIM1 is a candidate therapeutic adjuvant for treatment during ureteral obstruction to maintain renal function. Alternatively, if CRIM1 inhibits OP-1 activity, a CRIM1 mimetic could block the interaction between CRIM1 and OP-1 with therapeutically useful effects.

[0196] Within the kidney, TGFβ has been implicated in vascular remodelling, premature termination of normal nephrogenesis, promotion of a transition of epithelial cells to mesenchymal cells and a variety of other effects. Increases in circulating TGFβ1 occur during diabetes. This may contribute to the onset of diabetic nephropathy via the induction of collagens 3 and 1 which result in scarring and fibrosis within the kidney. By acting as an inhibitor of this process, CRIM1 may be useful as a preventative therapy for diabetic patients. This is a very large issue for indigenous populations worldwide, including the Pima Indians, Inuits, African Americans and Australian Aboriginals who have high rates of diabetes, high rates of circulating TGFβ1 and high rates of end stage renal disease.

[0197] The human condition Alport syndrome results from defects in collagen IV resulting in damage to the glomerular basement membrane and subsequent renal failure. Recent data in mice have shown that in mouse models of this disease, inhibition of TGFβ1 ameliorates the focal thickening of the basement membrane characteristic of Alport syndrome (Cosgrove et al., 2000, Am. J. Pathol. 157 1649). The present invention therefore contemplates CRIM1 acting as an inhibitor of TGFβ1 and thereby being therapeutically useful in treatment of Alport syndrome. It is also contemplated that if CRIM1 potentiates or augments TGFβ1 activity, CRIM1 mimetics may be useful in treating Alport syndrome.

[0198] Recent data using human stem cells has revealed that these cells are extremely difficult to maintain in an undifferentiated state, hampering efforts at organ regeneration. One of the genes turned on as these cells start to differentiate is BMP4. Human pluripotent teratocarcinoma stem cell lines have been investigated as a model of human stem cells and show the expression of the stem cell marker Oct4 until, treated with BMP2, upon which they differentiate into endodermal precursors (Pera & Herszfeld, 1998, Reprod. Fertil. Dev. 10 551). By inhibiting BMP2 function, CRIM1 may act to maintain human stem cells in an undifferentiated state, which would be very useful in the expansion of such cells as a potentially unlimited source of many different cell types for cell-based gene and tissue therapies. Conversely, CRIM1 may work in concert with proteins such as BMP2 for the selective reprogramming of such cells for a particular lineage.

[0199] Embryonic expression of the Crim1 gene occurs in the notochord and floor plate, which are known to be the source of the embryonic organising centre for the developing central nervous system (CNS). The nucleotide sequence, protein sequence and conservation of the CRIM1 homologues in human, mouse and chick, and C. elegans predict the essential role of CRIM1 conserved function during animal evolution. These findings suggest that CRIM1 functions as part of signalling mechanism which is required for normal CNS development. This may also be important in tissue regeneration including kidney replacement therapy.

[0200] CRIM1 protein, nucleic acids encoding said protein, and interacting proteins which selectively bind such proteins may function as a regulator for normal neuronal differentiation in the spinal cord, and migration of neural crest-derived cells, by either direct or indirect interactions with other growth factors such as BMPs, TGFβs and IGFs that are thought to be involved in the normal and/or abnormal neuronal differentiation in mammalian CNS. CRIM1 may also function as a neural cell adhesion molecule that is required for the normal development and maintenance of neurons in the CNS during normal embryonic development in adult. CRIM1 may also promote development of neuronal processes such as axons in developing CNS.

[0201] CRIM1 protein, or nucleic acids encoding CRIM1 and interacting proteins which selectively bind such proteins will also find use in screening chemical libraries for regulators of neural differentiation, cell migration, adhesion and neuronal process growth, in genetic mapping, as probes for related genes, as diagnostic reagents for genetic neurological disease and in the production of specific-cellular and animal systems for the development of neurological disease therapy, particularly for conditions such as motor neuron disease. They may also be important in the derivation and in vitro culture of neural stem cells for stem cell therapy of neurological conditions.

[0202] Diagnostic Methods

[0203] The invention provides use of Crim1 nucleic acids and CRIM1 polypeptides for diagnostic purposes.

[0204] It will be appreciated that the Crim1 nucleic acids of the present invention provide useful reagents for chromosome tagging and localization, such as in human genetic mapping studies. As will be described hereinafter, the isolated human Crim1 nucleic acid corresponds to a gene located on chromosome 2p21-16.3. It is noted that heritable diseases such as spastic paraplegia (SPG4; Hazan et al., 1994, Hum. Mol. Genet. 3 1569) and holoprosencephaly (Schell et al., 1996, Hum. Mol. Genet. 5 223) map close to this chromosomal region in humans (SIX3 and Spastin respectively). Crim1 nucleic acids may therefore be useful in mapping and isolating hitherto unknown genes in this chromosomal region underlying other diseases. Gene mapping techniques are well known in the art, and a recent review of techniques such as linkage analysis, SNP analysis and uniparental disomy is provided in Vnencak-Jones, 1999, Am. J. Clin. Pathol. 112 S19, which is incorporated herein by reference.

[0205] Furthermore, the specification hereinafter provides a variety of methods utilizing isolated Crim1 nucleic acids for analysis of cell and tissue development, which method may be useful in diagnostic, forensic and general tissue-typing applications.

[0206] For example, the invention provides a method of determining whether an animal is predisposed to a genetically-heritable disease, which method includes the steps of:

[0207] (i) obtaining a nucleic acid sample from said animal; and

[0208] (ii) determining whether said nucleic acid sample includes a Crim1 nucleic acid mutation or polymorphism indicative of said human being predisposed to, or suffering from, said genetically-heritable disease.

[0209] It will be appreciated that according to this aspect, the Crim1 nucleic acid may be used as a basis for designing PCR primers, sequencing primers or hybridization probes to assist determination of whether said Crim1 nucleic acid contains a mutation or polymorphism indicative of said individual being predisposed to said disease.

[0210] As used herein “predisposed” refers to said individual having an increased likelihood of displaying disease symptoms, or being a carrier of, a predisposing mutation or polymorphism.

[0211] Preferably, said genetically-heritable disease may be a neurodegenerative disease such as motor neuron disease, a disease of the eye or more particularly the eye lens, such as glaucoma, cataracts, micropthalmia, microcomea or nuclear scelorosis of the lens. Other possible diseases may relate to heart development, tooth development or bone morphogenesis.

[0212] Suitably, the nucleic acid sample is genomic DNA, cDNA or mRNA. Preferably, the nucleic acid sample is genomic DNA.

[0213] Preferably, step (ii) employs a nucleic acid amplification technique such as PCR.

[0214] Analysis of said amplification products may be according to relative size, in which case high resolution gel electrophoresis or capillary electrophoresis are applicable. Analysis may also be achieved by nucleotide sequencing of said amplification products.

[0215] Other approaches are relevant to full mutation analysis of Crim1, for example using single stranded conformation polymorphism (SSCP) analyses. This would involve the extraction of genomic DNA from said individual and PCR amplification of each Crim1 gene exon. Polyacrylamide gel electrophoresis under specific denaturing conditions would reveal any amplified alleles which migrate differently to wildtype DNA. These can then be directly sequenced to identify the mutation.

[0216] It will also be appreciated that detection of said mutation or polymorphism may be achieved by a variety of other techniques. These include Denaturing Gradient Gel Electrophoresis (DGGE), an example of which is provided in Folde & Loskoot, 1994, Hum. Mut. Det. 3 83 (which is herein incorporated by reference), melt curve analysis, an example of which is provided in International Publication No. WO97/46714 (which is herein incorporated by reference), or Restriction Fragment Length Polymorphism (RFLP) analysis of said amplification products.

[0217] So that the invention can be understood in more detail, the skilled person is directed to the following non-limiting examples.

EXAMPLE 1

[0218] Isolation of Human CRIM1 (S52) cDNA

[0219] 1.1 Two-Hybrid Screening

[0220] Yeast Transformation

[0221] The library used for two-hybrid analysis was a 19-23 week human fetal kidney cDNA “MATCHMAKER” library purchased from Clontech (USA). cDNA inserts were cloned into the pGAD10 plasmid so that encoded protein would be expressed as LexA-AD fusion proteins.

[0222] The library was transformed into the yeast Saccharomyces cerevisiae L40 strain (4LexAop-HIS3; 8LexAop-lacZ) which had been previously transformed with the plasmid pBTMWT1D encoding the WT1D-LexA (DBD) fusion protein “bait”.

[0223] HIS3 Reporter Activation and Growth Selection

[0224] Activation of the HIS3 reporter gene was determined qualitatively by growth on SD plates lacking uracil, lysine, tryptophan, leucine, and histidine [SD(THULL)] after incubation for 3 days at 30° C. The relative strength of interaction was determined by replica plating colonies on plates containing 3-amino-triazole (3AT; Sigma Chemical Company). The range of 3AT concentration typically used was between 0.2 and 5.0 mM in SD(THULL) plates.

[0225] lacZ Reporter Activity

[0226] β-galactosidase levels were determined qualitatively by a filter assay or in a liquid assay. For filter assays, colonies were grown as an X-shape (to increase colony mass) on replica SD(THULL) plates till thick (3-4 days). A sterile circle of 3M paper cut to the size of the 10 cm plate was placed onto one plate of the colonies and pressed firmly across the surface of the plate to encourage even adherence of the colonies to the paper. The paper was then peeled from the plate and put through two cycles of freezing and thawing in liquid N₂. The paper was then overlayed onto a pad of three similar paper circles pre-wet in Buffer Z with X-gal and incubated in a 10 cm plastic petri dish at 30° C. Incubations were allowed to continue overnight if necessary. Liquid assays were carried out by a standard assay method.

[0227] Plasmid Recovery

[0228] Cells from 1 ml of overnight culture were pelleted and resuspended in 0.5 ml of S Buffer (10 mM KPO₄ pH 7.2, 10 mM EDTA, 50 mM 2-ME, 50 μg/ml zymolase) and incubated at 37° C. for 30 minutes. 0.1 ml of Lysing Solution (0.25 M Tris-HCl pH7.5, 25 mM EDTA, 2.5% SDS) was then added and the solution was vortexed and incubated at 65° C. for 30 minutes. 166 μl of 3 M potassium acetate was added and the solution was chilled on ice for 10 minutes before centrifugation for 10 minutes. The supernatant was transferred to a fresh tube and 0.8 ml of cold ethanol was added. After 10 minutes on ice the solution was centrifuged for 10 minutes and the supernatant discarded. The pellet was then washed in 70% ethanol, air-dried, and resuspended in 40 μl sterile H₂O. This method was used to recover the library “prey” plasmids from colonies that contained both HIS3 and lacZ activity. Plasmids were recovered by transformation of the yeast miniprep into E. coli HB101 electrocompetent cells and selection on M9 minimal media plates. Bacteria not transformed with the trp+ “bait” plasmid will not grow on this media.

[0229] 1.2 Library Screening and cDNA Sequencing

[0230] An insert corresponding to an S52 partial cDNA clone was isolated, and subjected to restriction endonuclease digestion with SacI and BamHI restriction enzymes to yield four sub-fragments of 900 bp, 800 bp, 600 bp and 300 bp respectively. These sub-fragments were ligated into pBluescript KSII-(Stratagene) and transformed into DH5a E. coli for sequencing. The 300 bp sub-fragment was labeled with ³²P-dCTP by random priming and used to screen a human fetal brain cDNA library in λGT10. Positive plaques identified by plaque hybridization were picked, phage isolated and inserts subcloned into pBluescript for further nucleotide sequencing. Initial sequencing was performed using standard reverse, forward, T3 or T7 primers as applicable to pBluescript. As further sequence was obtained, this was used to design additional sequencing primers.

EXAMPLE 2 Isolation of Mouse Crim1 (S52) cDNA

[0231] 2.1 Preparation of DNA Probes

[0232] The probe used for primary library screening was derived from a PCR product of Mouse EST 551975 (Genome systems) containing the predicted 3′ end of the mouse S52 cDNA. PCR was performed as above using Mouse S52 3′ 3F (5′ GCT CAG CAC CCC TTC TAT TTG C 3′; SEQ ID NO: 10) and Mouse S52 3′ 3R (5′ GTG ATG AGT CTC GCC TGG ATG 3′; SEQ ID NO: 11) primers at an annealing temperature of 57° C. The product was cloned into pGEM-T easy (Promega). The 5′ end sequence of the human S52 gene was used as a probe for secondary library screening of primary phage positives. For this purpose, an 800 bp fragment of a human S52 cDNA clone was excised with SacI and BamHI and purified using the Agarose gel extraction kit (Boehringer-Manneheim).

[0233] Radiolabeling of DNA probes with [γ-³²P] dCTP was performed using the Redi Prime-II labelling kit (Amersham Life Science). 2.2

[0234] Library Screening

[0235] Radiolabeled probes were used to screen an E11.5 random primed whole mouse cDNA library cloned into λgt10. Briefly, Duplicate Hybond N filters were lifted from each plate, air-dried for 30 minutes and cross-linked using a GS gene linker (BioRad). Filters were pre-hybridised (3 hrs) and then hybridised with 200 ng of radioactively labeled probe (16 hrs) in Church and Gilbert buffer (0.263 M sodium phosphate buffer, pH 7.2; 1 mM EDTA; 7% SDS; 1% BSA (Boehringer-Mannheim) at 65° C. Stringency washes were performed with SSC/SDS wash solutions, down to 0.2×SSC, 0.1% SDS (Sambrook et al., 1989, supra). Hybridized filters were exposed to X-ray film (Fuji) at −70° C. with intensifying screens.

[0236] 2.3 Cloning and Sequencing of Phage Inserts

[0237] Plaques identified via probe hybridization on duplicate filters were isolated and replated (Sambrook et al., 1989, supra) prior to subcloning cDNA inserts into pBluescript KS⁺. Sequencing was performed from both ends of the pBluescript KS⁺ vector using T7 and T3 primers. Additional primers were designed as sequence information was obtained.

[0238] 2.4. 5′ RACE

[0239] Amplification of the 5′ end of the mouse S52 cDNA was performed using 5′ RACE with primers specific to the 5′ end of a mouse cDNA obtained from cDNA library screening. For 5′ RACE, four primers were utilized, three of which were nested primers, and the other a forward control. MS52 GSP1 (5′ GGA ATC TTC AGG GCA ACG 3′; SEQ ID NO: 12) was used for cDNA synthesis, MS52 GSP2 (5′ CAC AGC GGG CCT TGC TGC AAT C 3′; SEQ ID NO: 13), and MS52 GSP3 (5′ GCC GGA GAT GAG GTT TTC ATT G 3′; SEQ ID NO: 14) was then used for PCR amplification. MS52 RACE F (5′ CCG CCA GAG GAA CGA GAG CTG 3′; SEQ ID NO: 15), was used in conjunction with MS52 GSP3 to test for the presence of S52 cDNA at each step of the 5′ RACE protocol by PCR. Template RNA was extracted from homogenized tissue (E11.5 whole mouse embryo) using the guanidinium thiocyanate (GTC), phenol and chloroform method (Sambrook et al., 1989, supra). 5′ RACE was performed using the 5′ RACE system for rapid amplification of cDNA ends kit (GibcoBRL). 5′ RACE products were purified using PCR spinclean kit (Progen), and ligated into pGEM-T easy cloning vector as described above. The ligation was transformed into E. coli cells, grown and plasmid DNA was isolated. The resulting clones were sequenced from both ends using T7 and SP6 primers.

EXAMPLE 3 DNA Sequencing

[0240] Sequencing was carried out using the ABI PRISM™ BigDYE™ terminator sequencing ready reaction kit (ABI) For the sequencing reaction, 0.3-0.5 μg of double stranded plasmid DNA template and 3.2 pmole of primer was added to 8 μl of ABI terminator ready reaction mix and the volume made up to 20 μl with distilled water. The reaction was overlaid with mineral oil and incubated in a Perkin Elmer Cetus thermocycler at 96° C. for 30 seconds, 50° C. for 15 seconds and 60° C. for 4 mins for a total of 25 cycles. DNA was purified from the reactions by ethanol precipitation. Gel separation and raw sequence data analysis was performed through the DNA sequencing service at the Australian Genome Research Facility.

EXAMPLE 4 Amplification of DNA by the Polymerase Chain Reaction

[0241] Polymerase chain reaction (PCR) was used for probe preparation, for radioactive labeling, 5′ RACE and chromosomal localization experiments. Unless otherwise stated, PCR was performed using the following reagents: 2.0 μl of Taq DNA polymerase buffer (Boehringer-Mannheim), 2 μl of 2.5 mM dNTPs, 0.8 μl of 50 μM primer (forward and reverse), 0.2 μl of 5 U/μl Taq DNA polymerase (Boehringer-Mannheim), 1-2 μl DNA template (various concentrations dependent on template type) and water to a final volume of 20 μl. Unless elsewhere stated the reactions were incubated in a MJ Research DNA engine thermocycler, with an initial denaturing step at 94° C. for 3 mins, then 35 cycles of 94° C. for 1 min; 50-60° C. (dependent on primer) for 45 seconds and 72° C. for 1 min, with a final extension step of 72° C. for 1 minute.

EXAMPLE 5 Northern Hybridization

[0242] 5.1 Whole Mouse mRNA Embryonic Northern

[0243] A mouse embryonic northern filter containing ES cell and whole mouse embryo RNA (E11.5-E15, and E17.5) was probed with a radioactively labeled PCR fragment generated from MS52 3F and MS52 3R primers. The filter was washed to a stringency of 2×SSC, 0.1% SDS at 65° C., and exposed to X-ray film for 5 days at −70° C. A control using a glyceraldehyde dehydrogenase (GAPDH) probe had previously been performed using the same filter, showing equal loading in all lanes.

[0244] 5.2 Human Multiple Tissue Northern

[0245] A human multiple tissue northern containing mRNA from adult tissues (CLONTECH, catalog # 7760-1), was probed with human S52 (800 bp SacI-BamHI fragment) at 65° C. This filter was washed to a stringency of 1×SSC: 0.1% SDS at 65° C., and exposed to X-ray film for 2 nights at −70° C. A control with GAPDH probe was used to reveal the mRNA present in each lane.

EXAMPLE 6 Whole Mount in situ Hybridization

[0246] 6.1 DIG-Labeled RNA Probe Preparation

[0247] Probe synthesis was carried out in a 20 μl reaction volume containing 9.5 μl of RNase-free distilled water, 4 μl of 5×transcription buffer (Promega), 4 μl of 0.1 M DTT (Promega), 2 μl of digoxygenin (DIG) nucleotide labelling mix (Boehringer-Mannheim), 1 μl of linearised template DNA, 0.5 μl of placental ribonuclease inhibitor (Promega) and 1 μl of T3 or T7 RNA polymerase (Promega). The reaction was incubated for 2 hours at 37° C. 2 μl of DNase-I (Promega) was added and reaction incubated for a further 15 mins. The probes were purified using a G-50 sephadex DNA purification column (Boehringer-Mannheim) and stored at −20° C. Mouse S52 antisense probe (RNA complementary and will hybridize to endogenous mRNA), was made by linearising MS52 clone 19 (from library screening) with SalI, and transcribing with T7 RNA polymerase. The sense probe (negative control) was made by cutting MS52 clone 19 with BamHI and transcribing with T3 RNA polymerase. Shh and Islet-1 radiolabelled probes were used as positive controls.

[0248] 6.2 Mouse Embryos

[0249] Pregnant female Quakenbush mice were obtained from the Central Animal Breeding House (University of Queensland). The age of embryos were determined by designating noon of the day of the seminal plug as E0.5. Also, stereotype limb shapes characteristic of each developmental stage was used as a guide after dissection of embryos. Embryos were obtained by the dissecting the uterus and removing the uterine wall and extraembryonic membranes in ice cold PBS. The embryos were fixed in 4% paraformaldehyde (PFA) overnight and then dehydrated through a PBTX (10 mM phosphate buffered saline [PBS], 0.1% Triton X-100): methanol series (75% PBTX: 25% methanol; 50% PBTX: 50% methanol; 25% PBTX: 75% Methanol) into 100% methanol and stored at −20° C. until required.

[0250] 6.3 Pre-Treatment of Embryos and Hybridization of DIG-Labeled RNA Probe

[0251] Embryos were rehydrated into PBTX by a reverse PBTX: methanol series, and treated with 10 μg/ml Proteinase K (Sigma) for 20 mins. After washing with PBTX twice for 10 mins, the embryos were refixed in 4% PFA, 0.2% glutaraldehyde (Sigma) in PBTX for 20 mins at 25° C. The embryos were incubated at 65° C. in pre-hybridization buffer (50% formamide, 5×SSC, 2% blocking powder [Boehringer-Mannheim], 0.1% Triton-X100, 0.5% CHAPS [Sigma], 1 mg/ml yeast RNA, 5 mM EDTA and 50 μg/ml heparin) for 3 hours with agitation. 1.0 μg DIG-labeled probe was then added and incubated overnight at 65° C.

[0252] 6.4 Post Hybridization Washes and Immunohistochemistry

[0253] Samples were washed at 65° C. for 5 mins in S1 (50% formamide, 5×SSC, 0.1% Triton-X100, 50% CHAPS). The samples were then washed sequentially in 75% S1: 25% 2×SSC; 50% S1: 50% 2×SSC; 25% S1: 75% 2×SSC for 5 mins at 65° C., and then 2 washes each of 2×SSC, 0.1% CHAPS and 0.2×SSC, 0.1% CHAPS for 30 mins at 65° C. The embryos were washed with TBTX (50 mM Tris.HCl [pH7.5], 150 mM NaCl, 0.1% Triton-X100) twice at 25° C., and incubated in pre-block solution (10% sheep serum, 2% BSA in TBTX) at 25° C. The embryos were incubated overnight at 4° C. with pre-absorbed anti-DIG monoclonal antibody (Boehringer-Mannheim) diluted {fraction (1/2000)} in pre-block solution. Preabsorption of antibody, was prepared by incubating 1 μl of antibody (0.75 units/μl) in 0.5 ml of TBTX containing 3 mg mouse embryo powder, 10% sheep serum, 2% BSA at 4 24° C. for 3 hours; removing the embryo powder by centrifugation, and diluting the supernatant to 2 ml with TBTX containing 10% sheep serum and 2% BSA.

[0254] 6.5 Post Antibody Washes and Staining

[0255] The embryos were washed with 0.1% BSA in TBTX, five times for 1 hour each at room temperature, and then overnight at 4° C. The embryos were washed twice with TBTX and then three times with NTMT (100 mM NaCl, 100 mM Tris.HCl [pH9.5], 50 mM MgCl₂, 0.1% Tween-20) at 25° C. The alkaline-phosphatase colouring reaction was carried out for 2-5 hours at 25° C. in the dark, with 0.338 μg/ml NBT (Boehringer-Mannheim), and 0.175 μg/ml BCIP (Boehringer-Mannheim). The reaction was terminated once sufficient colour was observed, by washing several times in NTMT and then PBTX at 25° C. Background labeling was removed by washing in PBS containing 1% TTX100. Embryos were fixed overnight in 4% PFA and stored in 50% glycerol:PBS.

EXAMPLE 7 Section in situ Hybridization

[0256] 7.1 Cryosectioning

[0257] Mouse embryos obtained as above were directly frozen in Tissue-tek (Sakura-Finetek) on dry ice, and stored at −20° C. Sections were cut using a Leica 3050 cryostat, with a section thickness of 14 μm. Sections were transferred immediately to pre-treated microscope slides (Superfrost) and air dried for several hours in a dust-free hood at 25° C.

[0258] 7.2 Tissue Preparation

[0259] Sections were fixed in 4% PFA for 10 mins, and then washed in PBS three times for 3 mins each. The sections were then incubated in acetylation mixture (1.33% triethanolamine [Fluka], 15 mM HCl, 0.25% Acetic anhydride [Fluka]) for 10 mins, and washed three times with PBS for 5 mins each at 25° C.

[0260] 7.3 Hybridization of DIG-Labeled Probe

[0261] 800 μl of hybridization mix (50% formamide, 5×SSC, 5×Denhardts, 250 μg/ml bakers yeast RNA [Sigma], 500 μg/ml herring sperm DNA) was added to each slide and incubated in a 5×SSC humidified chamber for 2 hours. This was replaced with 75 μl hybridization solution containing 200 ng/ml DIG labeled RNA probe. Siliconised coverslips (dipped twice in 3% silane in chloroform and 100% ethanol, and air-dried), were placed onto slides, and incubated in humidified (5×SSC, 50% formamide) chamber overnight at 60° C.

[0262] 7.4 Post Hybridization Washes and Immunohistochemistry

[0263] Coverslips were removed in 5×SSC, and then the sections were incubated in 0.2×SSC at 60° C. for one hour. After washing in 0.2×SSC at 25° C. for 5 mins, the slides were transferred into buffer B1 (0.1 M Tris pH 7.5, 0.15 M NaCl) for 5 mins at 25° C. 1 ml of B1 containing 10% sheep serum was placed on each slide, and incubated for 1 hour. 0.5 ml of anti-DIG antibody (1:5000 dilution in B1 with 0.1% sheep serum) was placed onto each slide and incubated overnight at 4° C. Slides were rinsed three times with B1, and equilibrated with Buffer B3 (0.1 M Tris pH 9.5, 0.1 M NaCl, 50 mM MgCl₂). The colouring reaction was performed by incubating the section with 0.338 μg/ml NBT and 0.175 μg/ml BCIP in buffer B3 for 2-5 hours. The reaction was terminated by rinsing several times with distilled water. Background staining was removed by washing with PBS containing 1% Triton-X100. The sections were fixed in 4% PFA and mounted using Mount Quick aqueous mounting medium (Daido Sangyo).

EXAMPLE 8 Photography

[0264] Photography of whole embryos was carried out on a Leica MZ8 dissecting stereo microscope with a Leica MPS48 exposure controller and camera. Photography of tissue sections was performed on an Olympus Provis AX70 microscope with an Oympus U-MCB exposure controller and camera, using Kodak Ektachrome 160T Pro film.

EXAMPLE 9 Chromosomal Localistion of Human 552 by Radiation Hybrid PCR Screening

[0265] 9.1 Primer Design and Control PCR

[0266] Radiation hybrid screening primers 3′ PCR 1F (5′ CTA CCA AAC AGT GTG AAG AAA 3′; SEQ ID NO: 16) and 3′ PCR 1R (5′ TGG TCA GTT ATC TTG AGG AA 3′; SEQ ID NO: 17) were designed using the sequence alignment of human and mouse S52 at a region which was thought to cover the 3′ untranslated region (UTR). These primers amplified a single band of 196 bp from human S52 cDNA and human genomic template DNA (Genebridge), but neither mouse cDNA (mouse EST #551975), mouse genomic DNA (Genebridge), or hamster genomic DNA (Genebridge) at 58° C. annealing temperature.

[0267] 9.2 Sample PCR Screening and Data Analysis

[0268] Screening of 93 samples comprising a human:hamster hybrid genomic DNA panel (Gyapay et al., 1996, Hum. Mol. Genet. 5 339), was performed by PCR with 3′ PCR 1F and 3′ PCR 1R primers. 10 ml of the reaction was loaded onto a 2.0% gel and each sample was scored as either negative (0), positive (1) or unclear (2). The results were analyzed at the whitehead institute database (www-genome.wi.mit.edu), which links the “vector” of 0's 1's and 2's to the closest chromosomal marker by pairwise analysis (Gyapay et al., 1996, supra). A LOD (likelihood of odds) score of greater than 15 to the closest marker was considered as statistically significant.

EXAMPLE 10 Immunofluorescence Analysis of Mouse Kidney

[0269] 10.1 Antibody Production

[0270] Polyclonal antibodies were raised in rabbits against a pGEX fusion protein containing:

[0271] (i) a fragment of human CRIM1 corresponding to the following 41 C-terminal amino acids: RVQVDSSQRMLRIAEPDARFSGFYSMQKQNHLQADNF (SEQ ID NO:8) YQTV; and (ii) KVCQPGYLNILVSKASGKPGEC. (SEQ ID NO:9)

[0272] The polyclonal antibodies were subsequently affinity-purified using the immunizing fragment.

[0273] 10.2 Immunofluorescence

[0274] Pregnant mice were sacrificed at 11.5 days post-coitum. Embryos were collected into ice-cold MEM medium, bisected transversely between fore and hind limbs with a scalpel, and the lower portion bisected through the spinal cord using a pair of 30 gauge needles. The metanephric mesenchyme with attached ureteric bud were removed using the same needles and collected into fresh medium. Explants were cultured in MEM/10% FCS for up to 5 days. Media was changed every 24 hrs. On the day of harvesting, explants were washed in PBS, fixed in methanol (10 min at −20° C.), washed in PBS (10 min at RT), incubated with primary antibody (1 hr at 37° C.), washed in PBS (15 min at RT) before being mounted on coverslips and examination under an Olympus fluorescence microscope.

EXAMPLE 11 Subcellular Localization of CRIM 1

[0275] 11.1 Transfection of Cells

[0276] Transfection and co-transfection reactions were performed in 6 well tissue culture plates using 50-80% confluent COS-7 cells plated the previous day. Following manufacturer's instructions, 1 μg of highly purified DNA (Nucleobond AX 500 column, Macherey-Nagel) was used per transfection reaction with a ratio of 3 μl of FuGENE 6 transfection reagent used per 1 μg DNA per well. Following transfection, cells were incubated in OPTI-MEM I reduced serum medium (Life Technologies) at 37° C. in an atmosphere supplemented with 5% CO2 and harvested 2.5-3 days post transfection. Cell media and lysate were separately analysed for the presence of the protein(s) of interest by Western blot.

[0277] 11.2 Cell Fractionation for Subcellar Localisation: Membrane and Soluble Fractions

[0278] Cells were cultured on 10 cm plates. These were put on ice, washed three times rapidly in cold buffer A (10 mM Tris pH 7.4; 1 mM EDTA), and scraped from plates in 0.5 mL buffer A containing protease inhibitors. Cells were incubated on ice for 5 minutes before being passed through a 27 gauge needle approximately 20 times. Lysate was centrifuged at 2,000 rpm for 10 min in bench top centrifuge. Supernatant was collected and spun at 45,000 rpm at 4° C. in a bench-top ultracentrifuge for 90 minutes.

[0279] The resulting supernatant (soluble fraction), was respun at 45,000 rpm for 30 min. The pellet (membrane fractions) was resuspended in buffer A

[0280] 11.3 Plasma Membrane, Heavy Membranes, Cytosol and Membranes

[0281] Cells were cultured on 30 cm plates, washed in ice cold HES Buffer (20 mM Hepes, 1 mM EDTA, 250 mM sucrose, pH7.4), then scraped into 2.5 ml HES containing protease inhibitors (HES+), and then passed through a 27 gauge needle 20 times. The lysate was centrifuged at 12, 000 rpm for 20 min. The supernatant was collected as the cytosol and membrane fraction.

[0282] The pellet was resuspended in 10 ml HES+ and recentrifuged. This pellet was resuspended in 0.5 mL HES+ and layered over 10 mL 1.12 M sucrose in HES in Beckman Ultra-Clear (14×89 mm) tubes for centrifugation in a Beckman SW41 rotor for 60 min at 25 000 rpm. 1 mL of solution was collected at the interphase. 2 mL of HES+ was added for further centrifugation in a Beckman TLA 100.3 rotor in bench-top ultracentrifuge. The resulting supernatant was discarded and the pellet resuspended in HES+, and snap freeze. This represents the plasma membrane fraction.

[0283] The pellet from the sucrose phase was resuspended in HES+. This represents the heavy membrane fraction, containing mitochondria, nuclei, and other heavy membranes.

[0284] 11.4 Immunofluorescence Analysis

[0285] Cells were cultured in small dished with coverslips placed in the bottom at the time of seeding with cells. For Immunofluorescence, the media was removed and the cells washed with PBS twice, before fixation in 4% PFA (in PBS pH 7.4) for 15 min, washing in PBS for 2 min three times, permeabilisation in 0.1% Triton/PBS for 5 min, washing in 0.5% BSA/PBS twice for 4 minutes and blocking in 0.5% BSA/PBS for 10 min. Blocking solution was thoroughly aspirated before the addition of the primary antibody diluted in 0.5% BSA/PBS (typical dilution of 1/100-1/200).

[0286] Coverslips were incubated with primary antibody for 1 hr at room temperature, then washed three times for 2 minutes with 0.5% BSA/PBS. Slides were aspirated thoroughly before addition of the secondary antibody (typical dilution 1/200-1/400). Coverslips were incubated for 1 hr. in secondary antibody, then washed with 0.5% BSA/PBS three times for two minutes each. Coverslips were then mounted on slides with vectorshield or other mounting medium.

EXAMPLE 12 Interaction Between CRIM1 and Members of the TGF-β Superfamily

[0287] 12.1 Coimmunoprecipitation With BMPs

[0288] Protein A immobilised on Sepharose 4B fast flow resin (Sigma) was equilibrated in protein A buffer (0.05 M Tris, 0.15M NaCl pH 8.5) prior to antibody being bound in a ratio of 5 μl of antibody to 40 μl of resin per immunoprecipitation reaction. Resin and antibody were incubated together for 90 minutes at room temperature on a rotating wheel, and the resin washed three times in protein A buffer prior to use.

[0289] From each transfection reaction, cell media was collected, cellular debris pelleted and removed and the media incubated with the bound resin for two hours at room temperature on a rotating wheel. The cell monolayer was gently washed with PBS, resuspended in 600 μl of lysis buffer (protein A buffer pH 7.4 containing complete protease inhibitors (Roche), 0.5% Triton X-100), transferred to 1.5 ml eppendorf tubes and incubated for one hour at 4° C. on a rocking platform. Insoluble cellular debris was pelleted, removed and the cell lysate incubated with the bound resin as described above. Following incubation, the resin was washed twice in lysis buffer followed by PBS, resuspended in 40 μl of SDS-PAGE loading buffer and boiled for 3 to 5 minutes prior to loading on a SDS-polyacrylamide gel. Immunoprecipitation reactions were analysed by Western blot, and generally using a primary antibody not used in the immunoprecipitation step. Fractions of cell lysate and cell culture media were retained prior to immunoprecipitation to allow the examination of the load material for the purposes of verification of the success of the transfections.

[0290] Ligand blotting was performed as described in Scheidegger et al., 2000, J. Biol. Chem. 275 26864, which is incorporated herein by reference.

[0291] Cell overlay technique was performed as described in Brose et al, 1999, Cell 96 795, which is incorporated herein by reference.

EXAMPLE 13 Purification and Bioassay of Recombinant CRIM1 Ectodomain

[0292] 13.1 Affinity Purification of Recombinant Ectodomain

[0293] The protocol followed for construction of an anti-myc column and purification of CRIM1 protein over the column was as described in the manufacturers instructions provided with the Affi-Gel Hz Immunoaffinity Kit (BIO-RAD, catalog number 153-6060).

[0294] The anti-myc column was allowed to reach room temperature prior to use, then conditioned using 2 to 4 bed volumes of 0.2 M glycine-Cl pH 2.5 elution buffer. Allow the column to drain then add 5 bed volumes of application buffer (PBS, pH 7.0).

[0295] To purify CRIM1 protein expressed as CrimEcto^(myc) in COS-7 cells, the cell media was collected and pelleted to remove cellular debris. 4 ml of media was concentrated using an Ultrafree-4 centrifugal filter unit (Millipore) to 1-1.5 ml then a {fraction (1/10)} volume of 0.5 M phosphate buffer pH 8.0 was added to increase media pH to 8.0. Cap the bottom of the column, add the concentrated media, cap the top of the column and incubate on a rotating wheel for 1-1.5 hours at room temperature. Drain the column, retaining the column run through for protein analysis, and wash the resin three times with 10 ml of PBS. The salt concentration of PBS can be increased to 0.5 M for one wash to remove contaminants if required. Elute the protein using 5 ml of 0.2 M glycine pH 2.5, and collect 750 μl aliquot's in eppendorfs containing 250 μl of 0.5 M phosphate buffer pH 8.0 so to increase the pH hence stability of the protein. Immediately wash the column with 20 ml of PBS and store at 4° C. in PBS containing 0.02% sodium azide.

[0296] Aliquots of the starting material, column run through and eluted fractions were analysed by Western blot and eluted fractions of interest were concentrated. Protein estimation was determined by analysing the purified, concentrated CRIM1 protein on a SDS-polyacrylamide gel alongside known concentrations of bovine serum albumin and staining the gel with Silver Stain Plus (BIO-RAD, catalog number 161-0449) (see FIG. 14A).

[0297] Each round of purification produces approximately 900 ng of protein as assessed by comparative Coomassie versus BSA standards. This recombinant protein is still recognised by the N-terminal CRIM1 antibody and can be used in functional bioassays.

[0298] 13.2 In Ovo Electroporation

[0299] Fertilised chick eggs were incubated in a forced-draft humidified incubator kept at 37.5° C. until the embryos reached Hamburger and Hamilton stage 10. The embryos were exposed by opening a small window on the eggshell and by removing overlaying vitelline membrane. Human Crim1 cDNAs (encoding CRIM1 ectodomain or CRIM1 full-length and with myc epitope) were inserted into a eukaryotic cell expression vector (pcDNA3). The expression construct DNA was injected into a lumen of the caudal neural tube by mouth pippetting. The concentration of DNA was 1-5 μg/μL and several nL of the DNA solution was injected into the spinal cord.

[0300] After injection of the DNA construct into the lumen of the neural tube, the injected embryo was placed between a pair of electrodes. A train of small voltage pulses (25V, 50 msec width and 5 times, BTX820 square pulse generator) was passed through the embryo to achieve the gene transfer by the electroporation.

[0301] After the electroporation, the embryo was rinsed with L15 culture medium and the window was closed by adhesive tape and returned to the incubator. The electroporated embryos were further incubated for another 48 hr.

[0302] At the end of incubation period, the electroporated embryos were removed from the eggshell and transferred into ice-cold phosphate buffered saline (PBS). The embryos were fixed with 4% paraformaldehyde in 10 mM PBS for 1 hour on ice, then immersed in 30% sucrose in 10 mM PBS for overnight. The embryos were further trimmed and froze in TissueTek compound and stored at −70° C. until analysis.

[0303] 14 micron thickness frozen sections were cut and placed on glass slides. After drying the sections, they were probed with primary antibodies for cell type-specific marker proteins (anti-Engrailed1 and anti-Islet1 mouse monoclonal antibodies). The primary antibodies were detected with species-specific secondary antibodies that recognize mouse IgG and conjugated with cyc3. The sections were also counterstained with DAPI to visualize the nucleus.

[0304] The labelled sections were observed using a compound microscope and pictures taken using a digital camera.

EXAMPLE 14 Results

[0305] 14.1. Isolation of Crim1 Nucleic Acids

[0306] Human Crim1

[0307] The human Crim1/s52 nucleic acid was initially isolated as a result of two-hybrid screening for WT1D-interacting proteins. A total of twenty-six large and 12 small colonies were identified by a filter-based β-galactosidase reporter assay.

[0308] “Prey” plasmids were isolated from 20 large and 10 small colonies, and retransformation experiments showed that none of the plasmids activated the HIS3 or the lacZ reporter genes on their own or with the negative control plasmid.

[0309] To determine which portion of the LexA-WT1D fusion protein the protein products of these plasmids interacted, yeast di-hybrid analyses were performed against a panel of LexA-WT1D deletion constructs, according to the methods of Fields & Song, 1989, supra and Vojtek et al., 1993, Cell 74 205. Interaction was assessed on the basis of growth in the absence of histidine. The results of this analysis indicated that all of the proteins interacted with the N-terminus of WT1D, and none with the zinc finger motifs within the C-terminus.

[0310] Although the inserts of the isolated plasmids could be grouped according to insert size after EcoR1 digestion, sequencing was performed to determine identity. All were sequenced with a pGAD10 5′ primer located immediately upstream of the 5′ end of each insert. From 25 plasmid sequences, 11 distinct clones were obtained, each of which were then sequenced from the 3′ end with the pGAD10 3′ primer.

[0311] S52 appeared to be a “false positive” identified by the two-hybrid assay in that it was present in the cDNA library oriented 3′→5′. The S52 partial cDNA obtained by two-hybrid screening was used to isolate a full-length cDNA, the nucleotide sequence of which is shown in FIG. 2 (SEQ ID NO: 5). The predicted amino acid sequence of the human CRIM1 polypeptide is shown in FIG. 1 (SEQ ID NO: 2).

[0312] A human genomic Crim1 sequence was obtained by BlastN searching of human genome project updates, which sequence is shown in FIG. 3. Two large regions of human genomic sequence (NH007814 & NH0501007) were identified which cover 236,303 bp. Sixteen of the seventeen Crim1 exons could be positioned within the genomic sequence. Exon 1 (defined as including the start ATG) resides on both PAC 28 and PAC 309 but not within the 236,303 bp region.

[0313] Mouse Crim1

[0314] A murine ortholog of human CRIM1/S52 was isolated, the murine Crim1/S52 nucleic acid shown in FIG. 2 (SEQ ID NO: 6) and deduced amino acid sequence shown in FIG. 1 (SEQ ID NO: 3). Initial identification was achieved by using the human S52 nucleic acid as the basis for a search of the EST database. This identified EST clone ID#551975 which was sequenced to reveal an apparent mouse ortholog of S52. Screening of an E11.5 mouse embryo DNA library was performed using the mouse EST clone, with positives secondarily screened with the human S52 clone. This resulted in the isolation of a cDNA which partially overlapped the existing murine EST. The remaining 5′ mouse S52 sequence was then obtained by 5′ RACE, to provide the mouse Crim1 nucleic acid shown in FIG. 2. Mouse Crim1 exhibits 84% nucleotide sequence identity to human Crim1.

[0315] Chicken Crim1

[0316] Chicken Crim1 cDNA clones were isolated by screening an embryonic day 5 chick brain cDNA library (Hargrave et al., 2000, Dev. Biol. 219 142) using a cDNA which corresponds to the 3′ end of mouse Crim1 coding region (fragment (756 bp in length, corresponding to amino acid positions between 680 and 932). From this cDNA library screening, six independent cDNA clones were isolated (clone # 181.1 and 41.1 cover nucleotide positions between 0 and 1085, clone #121.3 and 5.1 cover between 1800 and 3150, and clone #101.1 and 131.1 cover positions between 2800 and 4010). A cDNA clone that was missing from the contig of these cDNA clones was amplified using a set of oligonucleotide primers and polymerase chain reaction (PCR). Sequences if the forward and reverse primers for this PCR reaction were 5′-CTCGCTGTCCAGAAGATTCC-3′ (SEQ ID NO: 18) and 5′-GGTTGCCGCATTTGTCAGTG-3′ (SEQ ID NO: 19) respectively. This PCR product corresponds to the nucleotide positions between 1015 and 1920 of the contig of chick Crim1 homolog cDNA sequence. The chick Crim1 cDNA sequence is a total of 4010 nucleotides in length in which nucleotide positions between 380 and 3400 correspond to the coding region (see FIG. 2).

[0317] The predicted protein sequence of the chick CRIM1 homolog (FIG. 1; SEQ ID NO: 4) has 82.0% overall identity to that of human CRIM1 and 82.9% overall identity to that of mouse CRIM1. The predicted chick CRIM1 homolog contains putative signal peptide sequence at amino positions between 1 and 46, insulin-like growth factor binding protein-like motif at amino acid positions between 49 and 85. There are also six conserved cysteine-rich repeats at amino acid positions between 348 and 402, 415 and 468, 620 and 674, 691 and 746, 765 and 820, 831 and 885, a putative transmembrane domain at amino acid positions between 948 and 973 and a short stretch of putative cytoplasmic tail.

[0318] Wholemount in situ analysis of developing chick embryos was used to investigate the expression pattern of the chicken Crim1 gene. In chick embryos, as in mouse, Crim1 mRNA is initially expressed by the floor plate and notochord, and developing motor neuron in ventral aspect of the neural tube. This confirms chicken Crim1/CRIM1 as a true ortholog of human and mouse Crim1/CRIM1..

[0319] 14.2. CRIM1 Polypeptide Structure

[0320] An alignment of the human, mouse and chicken Crim1 protein sequences shown in FIG. 1 (SEQ ID NOS: 2, 3, and 4 respectively) has identified several regions outside of the identified structural motifs which are unique to Crim1 proteins. Given that the human, mouse and chicken sequences represent orthologs from different organisms, we have defined an N-terminal motif PGECCPLP which is absolutely conserved between all three proteins. This motif, when used to search the Genbank non-redundant database to identify sequences similar to the Crim1 orthologs is a partial EST clone from Caenorhabditis elegans (emb|CAA94886.1|) which encodes hypothetical protein B0024.14 (pir∥T18649). This EST predicts an hypothetical protein containing the 6 cysteine-rich motifs and the transmembrane domain seen in all three Crim1 orthologs. In this protein, the region PGECCPLP has only two amino acid changes (PGNCCPPP). Searching with this sequence only detects hypothetical protein B0024.14, suggesting that the motif PGXCCPXP together with the presence of six cysteine rich repeats spaced in the same fashion as seen in these four proteins identifies orthologs of the CRIM1 family. Alignment of the three CRIM1/Crim1 orthologs highlighted a three amino acid conserved region consisting of RGD. This has been previously reported as a motif involved in mediation of adhesion of cells to the extracellular matrix via integrins in proteins such a latent transforming growth factor binding protein (LTBP) (Mangasser-Stephan & Gressner, 1999 297 363; Ruoslahti & Pierschbacher, 1986 44 517). The RGD motif within CRIM1 is located between the IGFBP motif and the first cysteine-rich motif.

[0321] The human and mouse CRIM1 polypeptides comprise 1036 and 1029 amino acids respectively, sharing 89% sequence identity. An alignment of human and mouse CRIM1 polypeptides and a putative C. elegans ortholog is shown in FIG. 4, and a comparative overview of polypeptide domain structure is provided in FIG. 5. The major structural features of CRIM1 include a putative signal peptide (Nielsen et al., 1997, Protein Eng. 10 1), an insulin-like growth factor binding protein (IGFBP)-like domain (Drop et al., 1992, Growth Regul. 2 69), six cysteine rich repeats (CRRs; Hunt & Barker, 1987, Biochem. Biophys. Res. Comm. 144 876) and a putative transmembrane (TM) domain identified by hydropathy plot analysis (Kyte & Doolittle, 1982, J. Mol. Biol. 157 105).

[0322] The human, mouse and chicken CRIM1 amino acid sequences therefore suggested an integral membrane protein featuring cysteine-rich repeats (CRRs) with the following pattern:—

[0323] CX₁₂₋₁₆WX₄ CX₂ CXCX₆ CX₄ CX₄₋₆ CX₉₋₁₁ CCPXC

[0324] where C=cysteine; W=tryptophan; P=proline; and X=any other amino acid.

[0325] Similar CRRs are found in the Drosophila short gastrulation (Sog) gene, Xenopus chordin (Chd) gene and the human thrombospondin gene (Francois et al., 1994, Genes & Dev, 8 2602; Sasai et al. 1994, Cell, 79 779; Dixit, et al., 1986, Proc. Natl. Acad. Sci. USA 83 5449).

[0326] 14.3. Tissue Distribution of CRIM1

[0327] Northern Analysis

[0328] Northern hybridisation of mouse embryo mRNA at different developmental stages shows that the major murine Crim1 mRNA isoform is 6.2 kb, whilst the human ortholog in adult tissues is approximately 6.0 kb (FIG. 6). There appears to be a 4.0 kb isoform expressed at lower levels in mouse embryonic tissues as well as adult human placenta. This shorter mRNA isoform could be due to splicing of 3′ UTR, since two different ESTs which appear to be spliced isoforms have been identified on the GenBank EST database. Mouse EST 551975 has a polyA sequence which truncates the mRNA at approximately 4.0 kb of sequence. A second EST (mouse clone #13361968) is 100% identical 5′ of the polyA sequence, however, this second EST does not contain the polyA sequence at the same position as mouse EST 551975. EST 13361968 is predicted to extend much further into the 3′ direction and may represent the larger isoform seen by northern analysis.

[0329] To test for conservation of Crim1 in vertebrates, a genomic southern blot was analyzed at a high stringency with a mouse S52 radioactively labelled probe (data not shown). Positive bands were seen in human, mouse, chicken (Gallus gallus) and zebrafish (Danio rerio). This suggests homologous genomic DNA sequences are present in lower vertebrates and predicts a high level of conservation of S52 throughout evolution.

[0330] Whole Mount and Section Analysis

[0331] The whole mount analysis shown in FIG. 7 reveals that S52 is expressed in the ureteric compartment of the developing kidney from E10.5, and appears to be limited to the very tip of the ureteric tree. S52 is also expressed in the developing floor plate and motor neurons, the developing eye lens, the vibrissae and the pinna of the developing ear.

[0332] In FIG. 8, analysis of S52 expression was performed on whole embryo preparations from E8.5-E11.0, a time at which the developing neural tube forms and early patterning is thought to occur. Expression of S52 had previously been demonstrated in a small subset of cells in the neural tube including the floor plate. Therefore the expression of S52 was compared to a notochord and floor plate marker, sonic hedgehog (Shh) (Echelard et al., 1993, Cell 75 1477). Onset of S52 expression occurs between E9.0 and E10.0 (FIG. 8 B,C). There is weak expression in the somites at E9.0. Subsequently, there is expression in the somites, the anterior floor plate and developing eyes at E10.0. Expression occurs between 0.5 and 1.0 day after floor plate differentiation, as defined by Shh expression.

[0333] At E10.5 S52 expression was detected in the motor neurons, the floor plate, the somites, and the eyes. Expression in motor neurons was not detected at E10.0, a time when the first motor neurons are thought to have differentiated and migrated out of the ventricular layer. Therefore, expression of S52 appears to occur shortly after the motor neurons begin to differentiate, as revealed by expression of the early motor neuron marker, Islet-1 (Ericson et al., 1992, Science 256 1555). By E11.0, expression is detected in the somites, the eye, and very faintly in the floor plate and motor neurons.

[0334] Expression of S52 persists in older embryos from E11.5 to at least E17.5 as shown by northern hybridisation (FIG. 6). The highest level of expression is seen in E13.5 embryos. Whole mount in situ hybridisation at this stage shows expression in the lens of the eye, the ear and the vibrassae (sensory hair follicles of the snout) (FIG. 8F).

[0335] CNS Analysis

[0336] To further characterise the cell types that express mRNA in the developing CNS, embryo sections were obtained and analyzed by in situ hybridisation. Detailed morphological expression pattern analysis was performed by thin section in situ hybridisation of the cervical region of E9.5 and E10.5 embryos (FIG. 9). In addition, whole mount in situ hybridisation was performed on older embryos (E11.5-E13.5) which had been cut at the cervical level of the neural tube (FIG. 9). In all cases, analysis of expression was compared to Shh (expressed in the floor plate and notochord), and Islet-1 (Isl-1) was also used as a comparison.

[0337] Preliminary expression of S52 by whole mount and section in situ hybridisation of the brain was undertaken, and compared with that of Shh.

[0338] Whole mount in situ hybridisation suggested that onset of S52 expression in the neural tube occurs between E9.5-E10.5. Subsequently these stages were chosen for section in situ hybridisation. Expression at E9.0 is very weak in the floor plate, with possibly some staining in the notochord of the cervical section of the developing neural tube (FIG. 9A). More posterior sections did not show any expression (data not shown), suggesting that S52 expression in the anterior floor plate occurs later than E9.5. By E10.5 (FIG. 9B) stronger expression was observed in the floor plate of cervical sections of the neural tube. Other sections also revealed strong floor plate expression throughout more posterior regions of the neural tube (data not shown). There is also expression in the notochord at this stage (FIG. 9B). At E10.5, expression is seen in differentiated motor neurons as determined by Islet-1 expression (FIG. 9 B,K). Interestingly, the most newly developed motor neurons in the ventral-medial neural tube (which express Islet-1) do not express S52. This may suggest that expression of S52 occurs in more mature motor neurons.

[0339] During E11.5-E13.5 expression of S52 is maintained in the motor neurons and the floor plate. By this stage, expression is also detected in more dorsal cell types (FIG. 9 C-E). Expression is first seen in a subset of cells in the dorsal neural tube (denoted as I1, FIG. 9C) at E11.5. Staining appears to be retained in these at E12.5 and E13.5. By E12.5 there is expression within a subset of cells in the medio-lateral neural tube (denoted as 12, FIG. 9D). By E13.5 (FIG. 9E) the domain of expression is very strong in the medio-lateral subset of cells. Although the exact types of S52 expressing cells in the more dorsal neural tube were not determined in this study, based on their positions, they are likely to be subsets of dorsal interneurons including commissural or association interneurons (Wentworth, 1984, J. Comp. Neurol. 222 96).

[0340] Preliminary expression pattern analysis of Crim1 in the anterior neural tube was undertaken. Section in situ hybridisation of transverse sections of E10.5, showed strong expression in the floor plate of the midbrain (FIG. 9O). Analysis of a whole mount sagittal section of E12.5, shows expression of S52 in the floor plate of the hindbrain and the midbrain (FIG. 9R). As well, expression is found in the dorsal midline of the hindbrain and the roof of the diencephalon, which gives rise to the corpus callosum. This suggests a role for S52 in development of the midbrain and the hindbrain, and possibly the diencephalon of the forebrain.

[0341] Expression Pattern of Crim1 in the Developing Urogenital Tract

[0342] In other data not shown, during kidney development, Crim1 showed expression both in the ureteric tree, the early condensing mesenchyme and distal comma-shaped bodies. This considerably overlaps with the expression patterns of BMP2 and BMP7. The strong expression in pretubular aggregate and early comma shaped bodies is identical to that of BMP2. As the nephron elongates, Crim1 becomes expressed in the proximal end of the S-shaped bodies together with BMP7 and WT1. Crim1 also displays a striking male-specific expression pattern in the fetal gonads, its expression strongest in the Sertoli cells of the developing testis. This sexually dimorphic pattern of expression in not likely to determine sex, but may be important in normal testicular maturation, where the IGFs and members of the TGFβ superfamily are involved in testicular development.

[0343] 14.5. Chromosomal Localisation of Human CRIM1

[0344] Based on the restricted pattern of expression and the highly level of conservation of CRIM1, it is likely that CRIM1 may have an important role in normal embryonic development. Therefore, chromosomal localisation was performed to identify any known human diseases which localize to the same chromosomal position as CRIM1.

[0345] Radiation Hybrid Mapping of CRIM1

[0346] Crim1 was localized using the Genebridge 4 radiation hybrid DNA panel (Gyapay et al., 1996, supra). This assay depends upon being able to distinguish between human and hamster sequences at the locus of interest. Comparison of human and mouse Crim1 cDNA sequences revealed a region with a number of differences. Primers were made from this sequence (3′ PCR 1F and 1R) which amplified human cDNA, human genomic DNA, but neither mouse cDNA nor mouse genomic DNA (FIG. 10A). This allowed assaying of the human-hamster radiation hybrid panel by only amplifying PCR products from hybrids which contained the human Crim1 locus.

[0347] Crim1 was positioned on chromosome 2 between D2S1852 (on band 2p21) and D2S1736 (on band 2p16.3) (FIG. 10B), by pairwise analysis with a high statistical significance (a maximum LOD score of greater than 16).

[0348] Confirmation of this location was determined by fluorescence in situ hybridization (FISH). Three human PAC clones containing Crim1 (PAC28; PAC309 5′ end; E2A4 3′ end) were labeled with biotin or digoxygenin and hybridized to human metaphase chromosomes, thereby revealing specific and identical fluorescent signals on both chromatids of the short arm of human chromosome 2, in the region of p21-22. This was confirmed for each of the three PAC clones in a total of 104 chromosomes. The Crim1 gene therefore localized to chromosome 2p21-16.3.

[0349] 14.5 Immunofluorescence Analysis of CRIM1

[0350] A rabbit polyclonal antibody raised against the C-terminal 41 amino acids of CRIM1 was used to examine the localization of S52 in developing kidney, as shown in FIG. 11. This analysis indicated that the predominant site of CRIM1 protein expression is in the developing and branching ureteric tree of the kidney. This is in agreement with the RNA in situ hybridization analysis of mouse kidney. Some background staining was evident in the epithelializing mesenchyme, which is likely to be derived from the Cy3-labeled secondary antibody. Labeling is much stronger in the ureteric tree, especially at the growing tips.

[0351] 14.6 Biochemical Characterisation and Subcellular Localisation of CRIM1

[0352] Using the antibodies previously described to both the N- and C-terminal ends of the CRIM1 protein and the use of myc epitope tagging, the present inventors have been able to examine characteristics of the CRIM1 protein by transfection of Crim1-containing expression constructs into cell lines.

[0353] Mammalian expression constructs were created from human CRIM1 using the pcDNA3 vector which employs a CMV promoter. These contained either the full length CRIM1 protein with an N-terminal myc epitope tag, full length with a C-terminal myc epitope tag or an ectodomain construct which does not include the transmembrane domain or C-terminal end of the protein. This ectodomain construct had a myc epitope tag at its C-terminal end, just past CR6 (see FIG. 12A)

[0354] The present inventors have also established via Western blot analysis of cell fractions and immunofluorescence of transfected cells that the full length CRIM1 protein is targeted for insertion into the plasma membrane and that it acts as a Type 1 transmembrane protein with the N-terminal end facing the media and the cytoplasmic tail facing the cytoplasm. In Western analyses of cellular fractions, full length CRIM1 protein is predominantly located within the light membrane fraction, which predominantly includes the plasma membrane, with some protein present in the heavy membrane fractions, which includes the endoplasmic reticulum (ER; FIG. 12B). Immunofluorescence of permeabilised cells with antibodies to either end of full length CRIM1 reveal a staining pattern consistent with the ER, suggesting that trafficking is occurring (FIG. 12 E,F&G). Immunofluorescence of non-permeabilised cells reveals uniform staining of the Crim1 protein all over the cell surface using the N-terminal antibody (FIG. 12H). Cells transfected with ectodomain was detectable within the ER of permeabilised cells using an anti-myc epitope antibody, but was not seen on non-permeabilised cells, suggesting that the secreted ectodomain protein is not attaching to the cell surface (FIGS. 12 I&J). Hence, the putative transmembrane domain does insert into the plasma membrane with the potential growth factor binding sites facing the location of secreted growth factors (Type 1 transmembrane protein).

[0355] As assessed by Western analysis after cell culture on FCS-free OPTIMEM media, full length CRIM1 protein does not appear to be secreted into the cell culture media. In contrast, an ectodomain construct of CRIM1 which does not contain the transmembrane domain or the C-terminal region, is freely secreted into the cell culture media (FIGS. 12 C,I &,J). This secreted protein does not associate with the outside surface of the cell as evidenced by a lack of visible immunofluorescence of cells transfected with the ectodomain construct as detected by an anti-myc epitope antibody (FIG. 12I).

[0356] The size of the ectodomain protein within the secretory pathway inside the cell appears smaller than once it has been secreted into the media (see arrows, FIG. 12C). To investigate further whether this resulted from glycosylation of the protein, protein harvested from cell culture media was treated with N-Glycosidase F(Boehringer Mannheim). The reduced the size of the protein (by 10-15 kDa), suggested the existence of N-linked glycosylation sites (FIG. 12D). An analysis of the sequence revealed three potential N-linked glycosylation sites, NES at the end of the IGFBP motif, NPT within CR3 and NNS between CR4 and CR5. This reveals that the protein is decorated with carbohydrates.

[0357] Anti-CRIM1 antibodies were used to investigate the size of endogenous CRIM1 protein to look for evidence of proteolytic cleavage or multiple isoforms. Aqueous and vitreous humor was used as a potential source of CRIM1 protein due to the high expression of the Crim1 gene within the epithelial cells of the lens and the abundance of TGFβ superfamily and insulin-like growth factors within the humors. A protein of approximately 150 kD was detected in both aqueous and vitreous humor using antibodies to both the N- and C-terminal ends of CRIM1. This suggest that processing of CRIM1 into a soluble form can occur in vivo (see FIG. 12D).

[0358] 14.7 Characterisation of Interactions Between CRIM1 and Members of the TGFβ Superfamily

[0359] To directly investigate the ability of CRIM1 protein to interact with members of the TGFβ superfamily, myc-tagged BMP4 and Crim1 constructs were co-transfected into COS cell lines and immunoprecipitation studies were performed under non-denaturing conditions using the N-terminal CRIM1 antibody. This was suboptimal as this antibody prefers to recognise denatured protein. However, these studies showed that the BMP4 preprotein within the cell lysate was immunoprecipitated together with CRIM1. Only occasional weak evidence existed to show that the secreted processed form of BMP4 was binding CRIM1 ectodomain. This may mean that CRIM1 only interacts with the preprotein or that the levels of BMP4 and CRIM1 present in the media were too low to detect an interaction. However, this is preliminary evidence that CRIM1 does interact with BMP4 (see FIG. 13A).

[0360] Co-transfections were also examined before immunoprecipitation. The presence of CRIM1 did not increase the degree of processing and secretion of either BMP4 or BMP7 (see FIG. 13B). However, the presence of either BMP4 or BMP7 within the cell resulted in the secretion of the full length CRIM1 protein (see FIG. 13C). Hence, either directly or indirectly, BMP can elicit the secretion of a CRIM1-derived protein of a similar size to that detected in aqueous humour. It is not clear how this occurs. However, this ability of BMPs to elicit processing of CRIM1 is further evidence for a functional interaction between the BMPs and CRIM1.

[0361] Ligand blotting was used to see if the CRIM1-like protein within aqueous humour could interact with recombinant TGFβ2. Aqueous humour was electrophoresed under non-denaturing conditions and transferred to PVDF membranes. Membranes were then incubated with or without recombinant TGFβ2, before being bound with TGFβ2 antibodies. A band equivalent to the CRIM1-like protein was detected with the TGFβ2 antibody, suggesting that the TGFβ2 does interact with this protein. The same band was detected in vitreous humour (see FIG. 13D).

[0362] Cell overlay assays were also used to investigate whether recombinant BMP4 could bind to CRIM1 expressing cells. COS cells were transfected with full length N-terminal myc-tagged Crim1 constructs. Recombinant protein was added to these cells in culture before analysis via immunofluorescence using antibodies to BMP4. CRIM1-expressing cells did stain positive with the BMP4 antibody only when the recombinant protein had been added to the cultures. This pattern was not seen in untransfected cells, indicating that BMP4 was binding to cells expressing full length CRIM1 protein but not to non-expressing cells (see FIG. 13E).

[0363] 14.8 Purification and Bioassay of Recombinant CRIM1 Ectodomain

[0364] For the purposes of functional assays, a subclone of Crim1 encoding only the N-terminal extracellular portion of the protein (amino acids 1-901) including an N-terminal myc epitope tag was generated as a mammalian expression construct. This ectodomain construct contains all 6 cysteine-rich motifs plus the IGFBP, but not the putative transmembrane domain or the cytoplasmic tail. The construct was transiently transfected into COS7 cells. Western blot analysis of cell lysates versus cell culture media confirmed that this fragment is freely secreted. A stable transformant producing CRIM1 ectodomain has been produced although the level of protein production is lower than post transfection.

[0365] Purification of myc-tagged CRIM1 ectodomain protein from transiently transfected cells was performed using an anti-myc epitope monoclonal antibody affigel column as described hereinbefore (se FIG. 14A).

[0366] In the developing neural tube, the dorsal cell type differentiation appears to be controlled, at least in part, by BMPs and members of the TGF-β superfamily which are secreted by the roof plate while differentiation of motor neuron in the ventral spinal cord is inhibited by these dorsally-derived factors.

[0367] A possible biological activity for the CRIM1 protein was examined by ectopically expressing the full length or ectodomain CRIM1 proteins in embryonic chick spinal cord using in ovo electroporation technique. Such spinal cord electroporations have resulted in disruption to the migration of neural crest cells from the spinal cord to form the dorsal root ganglia. Ectopic expression of CRIM1 in the dorsal neural tube in chick embryo also appears to repress the determination of dorsal interneurons, as assessed by the number of engrailed-1 (En-1) expressing cells, and ventral motor neurons, as assessed by the number of Islet-1 (I1-1) expressing cells (see FIG. 14B). These results suggest a role for CRIM1 in neural tube and neural crest development in vertebrates.

EXAMPLE 15 Discussion

[0368] CRIM1 Structure

[0369] The human, mouse and chicken Crim1 nucleic acids of the invention are predicted to encode a relatively large transmembrane protein of approximately 120 kD, with several highly conserved domains shared with other known proteins. Three lines of evidence suggest that this protein is a single pass integral membrane protein with a large extracellular domain. Firstly, there is a hydrophobic stretch of amino acids at the N-terminus, which is predicted to be a signal peptide (Nielsen et al., 1997a, supra).

[0370] Secondly, there is a transmembrane domain toward the C-terminus predicted by hydropathy plot analysis. Finally, the N-terminal domain contains six cysteine rich repeat (CRR) motifs, an RGD motif and an IGFBP-like motif, which are found within secreted proteins or within the extracellular domain of integral membrane proteins (Shimasaki & Ling, 1991, Prog. Growth. Factor Res. 3 243; Francois et al., 1994, Genes Dev. 8 2602). There is a high level of conservation of the N-terminal domain in the human and mouse CRIM1 sequences, and the C. elegans homolog. This suggests that CRIM1 is important in processes involving extracellular protein-protein interactions.

[0371] Cysteine rich repeats (CRRs) are motifs (58-70 amino acids) containing 10 conserved cysteines in a defined spacing pattern. These motifs have been found in a variety of proteins including the secreted precursor of type II collagen (Ryan & Sandell, 1990, J. Biol. Chem. 265 10334), the adhesion factors thrombospondin (Dixit et al., 1986, supra) and Von Willebrand Factor (Hunt & Barker, 1987, supra), NEL (Atsuhashi et al., 1995, supra), NEL-like proteins (NELL1 and NELL2; Watanabe et al., 1996, Genomics 38 273), and the cell patterning proteins, chordin (Sasai et al., 1994, supra; Streit et al., 1998, Dev. Suppl. 125 507) and short gastrulation (Francois et al., 1994, supra). The CRR in thrombospondin corresponds to a region which can bind to collagen (Dixit et al., 1986, supra), whilst in procollagen it appears important for oligomerization of mature collagen (Ryan & Sandell, 1990, supra). Furthermore, the CRRs in Sog and Chd appear important in binding to members of the TGF-β family (Piccolo et al., 1996, Cell 86 589). Therefore, these motifs in CRIM1 may be important in protein-protein interactions with other proteins found in the extracellular space.

[0372] At the predicted N-terminus of the mature CRIM1 protein is a domain which is found in a variety of proteins, which domain is characterized by the presence of an Insulin-like growth factor binding protein (IGFBP) like motif. This motif, which includes 8 highly conserved cysteines (Kim et al., 1997, Proc. Natl. Acad. Sci USA 94 12981), has been found in all six known IGFBPs (Shimasaki & Ling, 1991, supra). Furthermore, this domain has been found in the secreted proteins MAC25 (Murphy et al., 1993, Cell Growth Differ. 4 715), NOV (Martinerie et al., 1992, Oncogene 7 2529) connective tissue growth factor (CTGF) (Bradham et al., 1991, J. Cell Biol. 114 1285) CYR61 (O'Brien et al., 1990, Mol. Cell. Biol. 10 3569) and Drosophila twisted gastrulation (TSG) (Mason et al., 1994, Genes Dev. 8 1489). Mutation and structural analysis has revealed that this domain may fold as a separate tertiary structure through disulfide bonds (Drop et al., 1992, supra; Forbes et al., 1998, J. Biol. Chem. 273 4647). Furthermore, in vitro studies have shown that this domain is important for binding to growth factors such as IGF and insulin (Oh et al., 1996, J. Biol. Chem. 271 30322; Kim et al., 1997, supra). This suggests that CRIM1 may be actively involved in binding to and mediating the activity of certain growth factors in the extracellular space.

[0373] A remarkable outcome of this study was the striking conservation of CRIM1/Crim1 throughout evolution. An uncharacterized putative ortholog in C. elegans (B0024.4; Wilson et al., 1994, Nature 368 32) was identified by BlastP searching and demonstrated significant similarity to the human and mouse Crim1 polypeptide sequences. The C. elegans polypeptide has a very similar domain structure to the human and mouse CRIM1 polypeptides, but appears to be truncated at the N-terminus, containing neither a signal peptide or an IGFBP domain. However, the C. elegans ORF was predicted by a combination of genomic sequencing and overlapping EST clones, and may not encode a full length protein sequence (Wilson et al., 1994, supra). In all three species, 95% of cysteine residues were found to be conserved in the extracellular domain, suggesting that these homologs and orthologs have very similar structures.

[0374] The presence of Crim1 genes in chicken (Gallus gallus) and zebrafish (Danio rerio) predicts that highly homologous proteins are present in a variety of vertebrate species. Due to this conservation, it is likely that Crim1/CRIM1 and its homologues have important and conserved roles in both vertebrates and invertebrates.

[0375] The expression pattern of Crim1 together with sequence data suggests that this gene may be involved in dorsal-ventral patterning by regulating BMP mediated signalling patterning in the CNS. BMP proteins are expressed widely throughout the developing central nervous system. Antagonism of BMP proteins by secreted factors such as Noggin, Chordin and Follistatin has been shown to be an important conserved mechanisms of patterning throughout vertebrate development (Holley et al., 1995, Nature 376 249; Sasai et al., 1995, supra; Zimmerman et al., 1996, Cell 86 599; Fainsod et al., 1997, Mech. Dev. 63 39).

[0376] A gradient of signalling across the newly developed neural tube is thought to establish a basic dorsal-ventral polarity, determining the fates of neurons and glia by the dorsal-ventral position of their precursor cells in the neural tube (Tanabe & Jessell, 1996, supra). This patterning is thought to be controlled by Shh from the notochord and floor plate, and BMP proteins (BMP4, BMP5, BMP7 and DSL-1) from the epidermal ectoderm and roof plate (Roelink et al., 1994, Cell 76 761; Liem et al., 1995, Cell 82 969; Roelink et al., 1995, Cell 81 445; Liem et al., 1997, Cell 91 127).

[0377] A key structural feature of CRIM1 is the multiple CRRs. Of the proteins which contain CRRs, Drosophila Sog and Xenopus chordin are of most interest because their overall structural organisation is similar to that of CRIM1. Both Chd and Sog gene products are thought to bind to TGF-β superfamily members and antagonize intercellular signalling (Sasai et al., 1994, supra; Piccolo et al., 1996, supra; Yu et al., 1996, supra). In vitro binding assays have shown that Chd protein can bind BMP4 with high affinity (Piccolo et al., 1996, supra). In Xenopus, direct inhibition of BMP4 by Chd (secreted from the organiser), has been shown to act as a dorsalising factor during gastrulation, and result in formation of dorsal structures such as neural tissue (Sasai et al., 1994, supra; Piccolo et al., 1996, supra). In Drosophilia, Sog has been shown by genetic epistasis to inhibit the signalling of decapentapegic (a TGF-β superfamily member which very similar to Xenopus BMP4), resulting in ventralisation of the gastrulating fly embryo (and induction of neural tissue) (Francois et al., 1994, supra). Since overexpression of Sog in Xenopus, can rescue null mutants of Chd, it has been postulated that Sog and Chd encode for proteins which are functionally equivalent (Schmidt et al., 1995a, Development 121 4319).

[0378] However, there is significant homology between the two proteins only within the CRRs (Francois & Bier, 1995, Cell 80 19). Although the exact structural and biochemical properties of CRRs are largely known, it is likely that the CRRs in Sog and Chd are important for binding to TGF-β molecules.

[0379] CRIM1 and the Eye

[0380] CRIM1 is strongly expressed in developing lens. and may play a role in maintaining this layer of cells as an epithelium. Hence, CRIM1 may act in an anti-cataractogenic fashion. Significant evidence exists to suggest that within the eye, TGFβ, particularly TGFβ2, can act in a cataractogenic fashion, leading to disruption in the morphology of the epithelial cells covering the front of the lens. The result of addition of recombinant TGFβ2 to lens explants cultures is the production of plaques identical in histology to those seen arising after surgery for the removal of cataracts. The suggestion that inhibitors of TGFβ will act in an anti-cataractogenic fashion is supported by the disclosures of International Publication WO95/13827 and International Application WO98/26784, each of which is incorporated herein by reference.

[0381] As described herein, CRIM1 is capable of interacting with TGFβ superfamily members such as TGFβ2 and BMP4. This strongly suggests that CRIM1 is involved in regulating the activities of TGFβ superfamily members and may therefore have therapeutic value in cataractogenesis (via interaction with TGFβ2) and bone formation and remodelling (via interaction with BMPs).

[0382] CRIM1 and Bone Development

[0383] TGFβ superfamily, including the bone morphogenetic proteins (BMPs), are expressed during, and are critical for, early embryo development and organogenesis of a variety of organs, including the kidney, eye (cornea and lens), heart, skeleton, tooth. limb and central nervous system. By interacting with members of the (bone morphogenic protein) BMP, GDF, activin or TGFβ family, CRIM1 may have a role in development, remodelling or repair of these various organs. More specifically, CRIM1 may be able to modulate activities such as bone remodelling, tissue regeneration and motor neuron specification. The genes may be useful in the diagnosis of diseases of these organs and the proteins or genes encoding them in some form of gene therapy construct in the treatment of such conditions. Several of the BMPs have the property of generating bone, including BMP2 and BMP7 (OP-1), and OP-1 is already known to have potential in bone remodelling, including the treatment of periodontal and orthopedic indications such as fractures.

[0384] CRIM1 and Kidney/Gonad Development

[0385] The expression pattern of Crim1 in the kidney and gonads overlaps with the expression patterns of BMP2 and BMP7. Several of the BMPs are expressed strongly during the development of the kidney, including BMP2,4 ,5 and 7, as for example reviewed in Godin et al., 1999, supra. More particularly, an important role for BMP7 has been shown in the formation of the kidney and defects including renal dysgenesis, cystic kidneys or agenesis.

[0386] BMP7 expression in the kidney continues after birth, as are receptors for BMP on the podocytes within the glomeruli. Application of BMP7 has been shown to decrease the loss of kidney function associated with acute ischaemic injury (Vukicevic et al., 1998, J. Clin. Invest. 102 202). It can also inhibit tubulointerstitial fibrosis and inflammation after unilateral ureteral obstruction. CRIM1 may similarly assist in such conditions by facilitating or increasing the duration of such BMP7 activity. BMP7 (OP-1) has been found to be preventative for renal fibrosis associated with ureteral obstruction. OP-1 administration can prevent tubular atrophy and diminish the activation of tubulointerstitial inflammation and fibrosis, thereby preserving renal function (Hrusuka et al., 2000, Am. J. Renal. Physiol. 279 130).

[0387] Within the kidney, TGFβ has been implicated in vascular remodelling, premature termination of normal nephrogenesis, promotion of a transition of epithelial cells to mesenchymal cells and a variety of other effects. Increases in circulating TGFβ1 occur during diabetes. This may contribute to the onset of diabetic nephropathy via the induction of collagens 3 and 1 which result in scarring and fibrosis within the kidney.

[0388] Therefore, through interactions with TGFβ and or BMPs such as OP-1, CRIM1 may regulate kidney and gonad development (such as testicular maturation), and thereby constitute a therapeutic candidate for renal and gonadal diseases such as described above.

[0389] CRIM1 and Neuronal Development

[0390] The spatial and temporal control of neuron differentiation by adjacent cells along the dorsal-ventral axis are important even after initial cell patterning by Shh and BMPs. Whilst early patterning by Shh and BMP proteins was thought to be sufficient to specify the cellular subtypes along the dorsal-ventral axis, various in vivo experiments have shown that other signalling factors must also be involved at later stages of dorsal-ventral patterning. The control of differentiation of ventral interneurons which express the homeodomain containing gene, Engrailed-1 (En-1), was shown to be dependant on the concentration of Shh-N in vitro (Ericson et al., 1996, supra). However, this class of ventral interneuron was missing in the mouse in which more ventrally located motor neurons are eliminated by the targeted inactivation of Islet-1, a gene essential for generation of motor neurons (Pfaff et al., 1996, Cell 84 309). It is possible that other intercellular signalling molecules expressed by motor neurons which are involved in the process of En-1 interneuron differentiation. Such factors may effect dorsal-ventral patterning by interacting with either Shh or BMPs, and regulating the local concentration of these signalling.

[0391] CRIM1 may be actively involved in patterning neighbouring cells after initial dorsal-ventral pattern has been established by Shh and BMPs. Expression of S52 mRNA in the motor neurons appears at the right time (E10.5) to influence differentiation of interneurons expressing En-1 (Pfaff et al., 1996, supra). CRIM1 could conceivably achieve this by inhibition of BMP proteins or other TGF-β signalling molecules, which are expressed in the dorsal neural tube (Liem et al., 1995, supra; Liem et al., 1997, supra). By inhibiting BMP proteins, CRIM1 could allow differentiation of these ventral interneurons. Expression of CRIM1 in other populations of neurons such as dorsal interneurons may have similar functions in cell patterning, allowing specification of adjacent cell types.

[0392] Another function of signalling molecules expressed by the floor plate is the control of axon guidance. The secreted signalling molecule NETRIN-1, and various cell surface proteins, such as those of the immunoglobulin superfamily are expressed by the floor plate and are known to be involved in axon guidance in the central and peripheral nervous system (Tessier-Lavigne & Goodman, 1996, supra).

[0393] The results of expression pattern analysis presented in this study, suggest that CRIM1 may be involved in axon guidance. Firstly, onset of expression in the floor plate occurs around E9.5-E10.5 in mouse (FIG. 9), a time when commissural neurons are thought to form and begin sending axons toward the floor plate (Colamarino & Tessier-Lavigne, 1995, Cell 81 621). Secondly, expression occurs in distinct subsets of neurons along the lateral neural tube during early development defining a pathway by which axons of commissural, association, and sensory neurons move throughout the lateral neural tube (Wentworth, 1984, supra). Since Crim1 appears to encode a transmembrane bound protein which could be involved in extracellular interactions, this molecule may be involved in cell-cell contact dependant guidance of axons in the neural tube.

[0394] Embryonic expression of the Crim1 gene occurs in the notochord and floor plate, which are known to be the source of the embryonic organising centre for the developing central nervous system (CNS). The nucleotide sequence, protein sequence and conservation of the CRIM1 homologues in human, mouse and chick, and C. elegans predict the essential role of CRIM1 conserved function during animal evolution. These findings suggest that CRIM1 functions as part of signalling mechanism which is required for normal CNS development. This may also be important in tissue regeneration including kidney replacement therapy.

[0395] CRIM1 protein, nucleic acids encoding said protein, and interacting proteins which selectively bind such proteins may function as a regulator for normal neuronal differentiation in the spinal cord, and migration of neural crest-derived cells, by either direct or indirect interactions with other growth factors such as BMPs, TGFβs and IGFs that are thought to be involved in the normal and/or abnormal neuronal differentiation in mammalian CNS. CRIM1 may also function as a neural cell adhesion molecule that is required for the normal development and maintenance of neurons in the CNS during normal embryonic development in adult. CRIM1 may also promote development of neuronal processes such as axons in developing CNS.

[0396] CRIM1 protein, or nucleic acids encoding CRIM1 and interacting proteins which selectively bind such proteins will also find use in screening chemical libraries for regulators of neural differentiation, cell migration, adhesion and neuronal process growth, in genetic mapping, as probes for related genes, as diagnostic reagents for genetic neurological disease and in the production of specific-cellular and animal systems for the development of neurological disease therapy, particularly for conditions such as motor neuron disease. They may also be important in the derivation and in vitro culture of neural stem cells for stem cell therapy of neurological conditions.

[0397] Conclusion

[0398] In light of the foregoing, it will be appreciated that the human Crim1 nucleic acid, the Crim1 genomic sequence and the CRIM1 polypeptide isolated by the present inventors, may provide new and useful diagnostic and therapeutic tools applicable to one or more of a variety of diseases such as those described above. The Crim1 nucleic acids of the invention are also useful in chromosomal mapping studies and analysis of tissue type and tissue and organ development.

[0399] Throughout this specification the aim has been to describe the preferred embodiments of the invention without limiting the invention to any one embodiment or specific collection of features. It will therefore be appreciated by those of skill in the art that, in light of the instant disclosure, various modifications and changes can be made in the particular embodiments exemplified without departing from the spirit and scope of the present invention.

[0400] It will also be appreciated that all patent and scientific literature and computer programs described in this specification are incorporated in their entirety herein by reference.

1 24 1 8 PRT Artificial Consensus Amino Acid Sequence 1 Pro Gly Glu Cys Cys Pro Leu Pro 1 5 2 1036 PRT Homo sapiens 2 Met Tyr Leu Val Ala Gly Asp Arg Gly Leu Ala Gly Cys Gly His Leu 1 5 10 15 Leu Val Ser Leu Leu Gly Leu Leu Leu Leu Leu Ala Arg Ser Gly Thr 20 25 30 Arg Ala Leu Val Cys Leu Pro Cys Asp Glu Ser Lys Cys Glu Glu Pro 35 40 45 Arg Asn Cys Pro Gly Ser Ile Val Gln Gly Val Cys Gly Cys Cys Tyr 50 55 60 Thr Cys Ala Ser Gln Arg Asn Glu Ser Cys Gly Gly Thr Phe Gly Ile 65 70 75 80 Tyr Gly Thr Cys Asp Arg Gly Leu Arg Cys Val Ile Arg Pro Pro Leu 85 90 95 Asn Gly Asp Ser Leu Thr Glu Tyr Glu Ala Gly Val Cys Glu Asp Glu 100 105 110 Asn Trp Thr Asp Asp Gln Leu Leu Gly Phe Lys Pro Cys Asn Glu Asn 115 120 125 Leu Ile Ala Gly Cys Asn Ile Ile Asn Gly Lys Cys Glu Cys Asn Thr 130 135 140 Ile Arg Thr Cys Ser Asn Pro Phe Glu Phe Pro Ser Gln Asp Met Cys 145 150 155 160 Leu Ser Ala Leu Lys Arg Ile Glu Glu Glu Lys Pro Asp Cys Ser Lys 165 170 175 Ala Arg Cys Glu Val Gln Phe Ser Pro Arg Cys Pro Glu Asp Ser Val 180 185 190 Leu Ile Glu Gly Tyr Ala Pro Pro Gly Glu Cys Cys Pro Leu Pro Ser 195 200 205 Arg Cys Val Cys Asn Pro Ala Gly Cys Leu Arg Lys Val Cys Gln Pro 210 215 220 Gly Asn Leu Asn Ile Leu Val Ser Lys Ala Ser Gly Lys Pro Gly Glu 225 230 235 240 Cys Cys Asp Leu Tyr Glu Cys Lys Pro Val Phe Gly Val Asp Cys Arg 245 250 255 Thr Val Glu Cys Pro Pro Val Gln Gln Thr Ala Cys Pro Pro Asp Ser 260 265 270 Tyr Glu Thr Gln Val Arg Leu Thr Ala Asp Gly Cys Cys Thr Leu Pro 275 280 285 Thr Arg Cys Glu Cys Leu Ser Gly Leu Cys Gly Phe Pro Val Cys Glu 290 295 300 Val Gly Ser Thr Pro Arg Ile Val Ser Arg Gly Asp Gly Thr Pro Gly 305 310 315 320 Lys Cys Cys Asp Val Phe Glu Cys Val Asn Asp Thr Lys Pro Ala Cys 325 330 335 Val Phe Asn Asn Val Glu Tyr Tyr Asp Gly Asp Met Phe Arg Met Asp 340 345 350 Asn Cys Arg Phe Cys Arg Cys Gln Gly Gly Val Ala Ile Cys Phe Thr 355 360 365 Ala Gln Cys Gly Glu Ile Asn Cys Glu Arg Tyr Tyr Val Pro Glu Gly 370 375 380 Glu Cys Cys Pro Val Cys Glu Asp Pro Val Tyr Pro Phe Asn Asn Pro 385 390 395 400 Ala Gly Cys Tyr Ala Asn Gly Leu Ile Leu Ala His Gly Asp Arg Trp 405 410 415 Arg Glu Asp Asp Cys Thr Phe Cys Gln Cys Val Asn Gly Glu Arg His 420 425 430 Cys Val Ala Thr Val Cys Gly Gln Thr Cys Thr Asn Pro Val Lys Val 435 440 445 Pro Gly Glu Cys Cys Pro Val Cys Glu Glu Pro Thr Ile Ile Thr Val 450 455 460 Asp Pro Pro Ala Cys Gly Glu Leu Ser Asn Cys Thr Leu Thr Gly Lys 465 470 475 480 Asp Cys Ile Asn Gly Phe Lys Arg Asp His Asn Gly Cys Arg Thr Cys 485 490 495 Gln Cys Ile Asn Thr Glu Glu Leu Cys Ser Glu Arg Lys Gln Gly Cys 500 505 510 Thr Leu Asn Cys Pro Phe Gly Phe Leu Thr Asp Ala Gln Asn Cys Glu 515 520 525 Ile Cys Glu Cys Arg Pro Arg Pro Lys Lys Cys Arg Pro Ile Ile Cys 530 535 540 Asp Lys Tyr Cys Pro Leu Gly Leu Leu Lys Asn Lys His Gly Cys Asp 545 550 555 560 Ile Cys Arg Cys Lys Lys Cys Pro Glu Leu Ser Cys Ser Lys Ile Cys 565 570 575 Pro Leu Gly Phe Gln Gln Asp Ser His Gly Cys Leu Ile Cys Lys Cys 580 585 590 Arg Glu Ala Ser Ala Ser Ala Gly Pro Pro Ile Leu Ser Gly Thr Cys 595 600 605 Leu Thr Val Asp Gly His His His Lys Asn Glu Glu Ser Trp His Asp 610 615 620 Gly Cys Arg Glu Cys Tyr Cys Leu Asn Gly Arg Glu Met Cys Ala Leu 625 630 635 640 Ile Thr Cys Pro Val Pro Ala Cys Gly Asn Pro Thr Ile His Pro Gly 645 650 655 Gln Cys Cys Pro Ser Cys Ala Asp Asp Phe Val Val Gln Lys Pro Glu 660 665 670 Leu Ser Thr Pro Ser Ile Cys His Ala Pro Gly Gly Glu Tyr Phe Val 675 680 685 Glu Gly Glu Thr Trp Asn Ile Asp Ser Cys Thr Gln Cys Thr Cys His 690 695 700 Ser Gly Arg Val Leu Cys Glu Thr Glu Val Cys Pro Pro Leu Leu Cys 705 710 715 720 Gln Asn Pro Ser Arg Thr Gln Asp Ser Cys Cys Pro Gln Cys Thr Asp 725 730 735 Gln Pro Phe Arg Pro Ser Leu Ser Arg Asn Asn Ser Val Pro Asn Tyr 740 745 750 Cys Lys Asn Asp Glu Gly Asp Ile Phe Leu Ala Ala Glu Ser Trp Lys 755 760 765 Pro Asp Val Cys Thr Ser Cys Ile Cys Ile Asp Ser Val Ile Ser Cys 770 775 780 Phe Ser Glu Ser Cys Pro Ser Val Ser Cys Glu Arg Pro Val Leu Arg 785 790 795 800 Lys Gly Gln Cys Cys Pro Tyr Cys Ile Glu Asp Thr Ile Pro Lys Lys 805 810 815 Val Val Cys His Phe Ser Gly Lys Ala Tyr Ala Asp Glu Glu Arg Trp 820 825 830 Asp Leu Asp Ser Cys Thr His Cys Tyr Cys Leu Gln Gly Gln Thr Leu 835 840 845 Cys Ser Thr Val Ser Cys Pro Pro Leu Pro Cys Val Glu Pro Ile Asn 850 855 860 Val Glu Gly Ser Cys Cys Pro Met Cys Pro Glu Met Tyr Val Pro Glu 865 870 875 880 Pro Thr Asn Ile Pro Ile Glu Lys Thr Asn His Arg Gly Glu Val Asp 885 890 895 Leu Glu Val Pro Leu Trp Pro Thr Pro Ser Glu Asn Asp Ile Val His 900 905 910 Leu Pro Arg Asp Met Gly His Leu Gln Val Asp Tyr Arg Asp Asn Arg 915 920 925 Leu His Pro Ser Glu Asp Ser Ser Leu Asp Ser Ile Ala Ser Val Val 930 935 940 Val Pro Ile Ile Ile Cys Leu Ser Ile Ile Ile Ala Phe Leu Phe Ile 945 950 955 960 Asn Gln Lys Lys Gln Trp Ile Pro Leu Leu Cys Trp Tyr Arg Thr Pro 965 970 975 Thr Lys Pro Ser Ser Leu Asn Asn Gln Leu Val Ser Val Asp Cys Lys 980 985 990 Lys Gly Thr Arg Val Gln Val Asp Ser Ser Gln Arg Met Leu Arg Ile 995 1000 1005 Ala Glu Pro Asp Ala Arg Phe Ser Gly Phe Tyr Ser Met Gln Lys 1010 1015 1020 Gln Asn His Leu Gln Ala Asp Asn Phe Tyr Gln Thr Val 1025 1030 1035 3 1037 PRT Mus musculus 3 Met Tyr Leu Val Ala Gly Gly Arg Gly Leu Ala Gly Cys Gly His Leu 1 5 10 15 Ser Val Ser Leu Leu Gly Leu Leu Leu Leu Leu Ala Arg Ser Gly Thr 20 25 30 Arg Ala Leu Val Cys Leu Pro Cys Asp Glu Ser Lys Cys Glu Glu Pro 35 40 45 Arg Ser Cys Pro Gly Ser Ile Val Gln Gly Val Cys Gly Cys Cys Tyr 50 55 60 Met Cys Ala Arg Gln Arg Asn Glu Ser Cys Gly Gly Ala Tyr Gly Leu 65 70 75 80 His Gly Ala Cys Asp Arg Gly Leu Arg Cys Val Ile Arg Pro Pro Leu 85 90 95 Asn Gly Asp Ser Ile Thr Glu Tyr Glu Val Gly Val Cys Glu Asp Glu 100 105 110 Asp Trp Asp Asp Asp Gln Leu Ile Gly Phe Glu Pro Cys Asn Glu Asn 115 120 125 Leu Ile Ser Gly Cys Asn Ile Ile Asn Gly Lys Cys Glu Cys Gly Thr 130 135 140 Ile Arg Thr Cys Asn Asn Pro Phe Glu Phe Pro Arg Lys Asp Met Cys 145 150 155 160 Leu Ser Ala Leu Lys Arg Ile Glu Glu Glu Lys Pro Asp Cys Ser Lys 165 170 175 Ala Arg Cys Glu Val Arg Phe Ser Pro Arg Cys Pro Glu Asp Ser Ile 180 185 190 Leu Ile Glu Gly Tyr Ala Pro Pro Gly Glu Cys Cys Pro Leu Pro Ser 195 200 205 Arg Cys Val Cys Asp Pro Ala Gly Cys Leu Arg Lys Val Cys Gln Pro 210 215 220 Gly Tyr Leu Asn Ile Leu Val Ser Lys Ala Ser Gly Lys Pro Gly Glu 225 230 235 240 Cys Cys Asp Leu Tyr Glu Cys Lys Pro Val Phe Ser Val Asp Cys Ser 245 250 255 Thr Val Glu Cys Pro Pro Val Gln Gln Ala Val Cys Pro Leu Asp Ser 260 265 270 Tyr Glu Thr Gln Val Arg Leu Thr Ala Asp Gly Cys Cys Thr Leu Pro 275 280 285 Ala Arg Cys Glu Cys Leu Ser Gly Leu Cys Gly Phe Pro Val Cys Glu 290 295 300 Val Gly Ser Thr Pro Arg Ile Val Ser Arg Gly Asp Gly Thr Pro Gly 305 310 315 320 Lys Cys Cys Asp Val Phe Glu Cys Val Asn Glu Thr Lys Pro Ala Cys 325 330 335 Val Phe Asn Ser Val Glu Tyr Tyr Asp Gly Asp Met Phe Arg Met Asp 340 345 350 Asn Cys Arg Phe Cys Arg Cys Gln Gly Gly Val Ser Ile Cys Phe Thr 355 360 365 Ala Gln Cys Gly Glu Leu Asn Cys Glu Arg Tyr Tyr Val Pro Glu Gly 370 375 380 Glu Cys Cys Pro Val Cys Glu Asp Pro Ile Tyr Pro Leu Asn Asn Pro 385 390 395 400 Ala Gly Cys Tyr Ala Asn Gly Gln Ile Arg Ala His Gly Asp Arg Trp 405 410 415 Arg Glu Asp Asp Cys Thr Phe Cys Gln Cys Ile Asn Gly Glu Pro His 420 425 430 Cys Val Ala Thr Ala Cys Gly Gln Ser Cys Met His Pro Val Lys Val 435 440 445 Pro Gly Glu Cys Cys Pro Val Cys Glu Glu Pro Thr Tyr Ile Thr Ile 450 455 460 Asp Pro Pro Ala Cys Gly Glu Leu Ser Asn Cys Ser Leu Lys Glu Lys 465 470 475 480 Asp Cys Val Tyr Gly Phe Lys Leu Asp His Asn Gly Cys Arg Thr Cys 485 490 495 Gln Cys Lys Ile Arg Glu Glu Leu Cys Leu Gly Leu Lys Arg Ala Cys 500 505 510 Thr Leu Asp Cys Pro Phe Gly Phe Leu Thr Asp Val His Asn Cys Glu 515 520 525 Leu Cys Gln Cys Arg Pro Arg Pro Lys Lys Cys Arg Pro Thr Met Cys 530 535 540 Asp Lys Phe Cys Pro Leu Gly Phe Leu Lys Asn Lys His Gly Cys Asp 545 550 555 560 Ile Cys Arg Cys Lys Lys Cys Pro Glu Leu Pro Cys Ser Lys Ile Cys 565 570 575 Pro Leu Gly Phe Gln Gln Asp Ser His Gly Cys Leu Ile Cys Lys Cys 580 585 590 Arg Glu Val Pro Pro Ser Ala Gly Pro Pro Val Leu Ser Gly Thr Cys 595 600 605 Leu Ser Met Asp Gly His His His Lys Asn Glu Glu Ser Trp His Asp 610 615 620 Gly Cys Arg Glu Cys Tyr Cys His Asn Gly Lys Glu Met Cys Ala Leu 625 630 635 640 Ile Thr Cys Pro Val Pro Ala Cys Gly Asn Pro Thr Ile Arg Ser Gly 645 650 655 Gln Cys Cys Pro Ser Cys Thr Asp Asp Phe Val Val Gln Lys Pro Glu 660 665 670 Leu Ser Thr Pro Ser Ile Cys His Ala Pro Gly Gly Glu Tyr Phe Val 675 680 685 Glu Gly Glu Thr Trp Asn Ile Asp Ser Cys Thr Gln Cys Thr Cys His 690 695 700 Ser Gly Arg Val Leu Cys Glu Thr Glu Val Cys Pro Pro Leu Leu Cys 705 710 715 720 Gln Asn Pro Ser Arg Thr Gln Asp Ser Cys Cys Pro Gln Cys Thr Asp 725 730 735 Asp Pro Pro Gln Pro Ser Thr Ser His Asn Glu Ser Val Pro Ser Tyr 740 745 750 Cys Arg Asn Asp Glu Gly Asp Ile Phe Leu Ala Ala Glu Ser Trp Lys 755 760 765 Pro Asp Ala Cys Thr Ser Cys Val Cys Val Asp Ser Ala Ile Ser Cys 770 775 780 Tyr Ser Glu Ser Cys Pro Ser Val Ala Cys Glu Arg Pro Val Leu Arg 785 790 795 800 Lys Gly Gln Cys Cys Pro Tyr Cys Leu Glu Asp Thr Ile Pro Lys Lys 805 810 815 Val Val Cys His Phe Ser Gly Lys Thr Tyr Ala Asp Glu Glu Arg Trp 820 825 830 Asp Ile Asp Ser Cys Thr His Cys Tyr Cys Leu Gln Gly Gln Thr Leu 835 840 845 Cys Ser Thr Val Ser Cys Pro Pro Leu Pro Cys Ala Glu Pro Ile Lys 850 855 860 Val Glu Gly Ser Cys Cys Pro Met Cys Pro Glu Met Tyr Val Pro Glu 865 870 875 880 Pro Thr Asn Val Pro Ile Glu Lys Lys Asn His Arg Gly Glu Ile Asp 885 890 895 Leu Glu Val Pro Met Trp Pro Thr Pro Ser Glu Asn Asp Ile Ile His 900 905 910 Leu Pro Arg Asp Met Gly His Leu Gln Val Asp Tyr Arg Asp Asn Asn 915 920 925 Arg Leu His Pro Gly Glu Asp Ser Ser Leu Asp Ser Ile Val Ser Val 930 935 940 Val Val Pro Ile Ile Ile Cys Leu Ser Ile Ile Ile Ala Phe Leu Leu 945 950 955 960 Ile Asn Gln Lys Lys Gln Trp Val Pro Leu Leu Cys Trp Tyr Arg Thr 965 970 975 Pro Thr Lys Pro Ser Ser Leu Asn Asn Gln Leu Val Ser Val Asp Cys 980 985 990 Lys Lys Gly Thr Arg Val Gln Val Asp Gly Pro Gln Arg Met Leu Arg 995 1000 1005 Ile Ala Glu Pro Asp Ala Arg Phe Ser Gly Phe Tyr Ser Met Gln 1010 1015 1020 Lys Gln Asn His Leu Gln Ala Asp Asn Phe Tyr Gln Thr Val 1025 1030 1035 4 1048 PRT Gallus gallus 4 Met Tyr Leu Ala Ala Val Ser Ala Gly Arg Arg Arg Pro Gly Gly Asp 1 5 10 15 Gly Gly Gly Gly Gly Gly Gly Trp His Leu Ala Ala Ala Gly Trp Leu 20 25 30 Leu Leu Leu Ala Leu Leu Leu Gly Gln Pro Gly Thr Arg Ala Leu Val 35 40 45 Cys Leu Pro Cys Asp Glu Ser Lys Cys Glu Glu Pro Lys Ser Cys Pro 50 55 60 Gly Ile Ile Val Leu Gly Ile Cys Gly Cys Cys Phe Met Cys Ala Arg 65 70 75 80 Gln Arg Asn Glu Ser Cys Gly Gly Val Tyr Gly Leu His Gly Ala Cys 85 90 95 Asp Arg Gly Leu Arg Cys Val Ile Arg Pro Pro Leu Asn Gly Asp Ser 100 105 110 Ile Thr Glu Tyr Glu Val Gly Val Cys Glu Asp Glu Asn Trp Asp Asp 115 120 125 Asp Gln Leu Leu Gly Phe Glu Pro Cys Asn Glu Asn Leu Ile Thr Gly 130 135 140 Cys Asn Ile Ile Asn Gly Lys Cys Asp Cys Asp Thr Ile Arg Thr Cys 145 150 155 160 Asn Asn Pro Phe Glu Phe Pro Ser Arg Asp Thr Cys Leu Ser Ala Leu 165 170 175 Lys Arg Ile Glu Glu Glu Lys Pro Asp Cys Ser Lys Ala Arg Cys Glu 180 185 190 Val Gln Phe Ser Pro Arg Cys Pro Glu Asp Ser Ile Leu Ile Glu Gly 195 200 205 Tyr Ala Pro Pro Gly Glu Cys Cys Pro Leu Pro Ser Arg Cys Val Cys 210 215 220 Asn Pro Ala Gly Cys Leu Arg Lys Val Cys Gln Pro Gly Tyr Leu Asn 225 230 235 240 Ile Leu Val Ser Lys Ala Ser Gly Lys Pro Gly Glu Cys Cys Asp Leu 245 250 255 Tyr Glu Cys Lys Pro Val Phe Ser Val Asp Cys Ser Thr Val Glu Cys 260 265 270 Pro Pro Val Gln Gln Val Val Cys Pro Leu Asp Ser Tyr Glu Thr Gln 275 280 285 Val Arg Leu Thr Ala Asp Gly Cys Cys Thr Leu Pro Thr Arg Cys Glu 290 295 300 Cys Leu Ser Gly Leu Cys Gly Phe Pro Met Cys Glu Ala Gly Ser Val 305 310 315 320 Pro Gln Ile Val Ser Arg Gly Asp Gly Thr Pro Gly Lys Cys Cys Asp 325 330 335 Val Phe Glu Cys Val Asn Glu Val Lys Pro Thr Cys Ile Phe Asn Ser 340 345 350 Met Glu Tyr Tyr Asp Gly Asp Met Phe Arg Met Asp Ala Cys Arg Phe 355 360 365 Cys Arg Cys Gln Gly Gly Val Ser Ile Cys Phe Ser Ala Gln Cys Gly 370 375 380 Glu Leu His Cys Asp Arg Tyr Tyr Val Pro Glu Gly Glu Cys Cys Pro 385 390 395 400 Val Cys Glu Asp Pro Val Tyr Pro Val Asn Asn Pro Ala Gly Cys Tyr 405 410 415 Ala Asn Gly Gln Ile Gln Ala His Gly Asp Arg Trp Arg Glu Asp Asp 420 425 430 Cys Thr Phe Cys Gln Cys Ile Asn Gly Asn Pro His Cys Val Ala Thr 435 440 445 Ala Cys Gly Gln Ser Cys Leu Asn Pro Val Lys Val Pro Gly Glu Cys 450 455 460 Cys Pro Val Cys Glu Glu Pro Thr Tyr Ile Thr Ile Gly Pro Pro Thr 465 470 475 480 Cys Glu Leu Leu Val Asn Cys Thr Leu Thr Glu Lys Asp Cys Ile Tyr 485 490 495 Ser Phe Lys Leu Asp Gln Asn Gly Cys Arg Ile Cys Gln Cys Lys Thr 500 505 510 Arg Glu Glu Leu Cys Thr Gly Leu Ile Ser Gly Cys Ser Leu Asp Cys 515 520 525 Ser Phe Gly Phe Gln Thr Asp Ala His Asn Cys Glu Ile Cys Gln Cys 530 535 540 Arg Pro Arg Pro Lys Lys Cys Lys Pro Ile Val Cys Asp Lys Tyr Cys 545 550 555 560 Pro Phe Gly Tyr Leu Lys Asn Lys His Gly Cys Glu Ile Cys Arg Cys 565 570 575 Lys Lys Cys Pro Glu Met Pro Cys Gly Lys Ile Cys Pro Met Gly Phe 580 585 590 Gln Gln Asn Ser His Gly Cys Val Ile Cys Lys Cys Arg Glu Ala Thr 595 600 605 Ala Ser Leu Met Pro Pro Val Lys Thr Gly Ser Cys Leu Ser Met Asp 610 615 620 Gly Arg Arg His Glu Asn Glu Glu Ser Trp His Asp Gly Cys Arg Glu 625 630 635 640 Cys Tyr Cys His Asn Gly Arg Glu Met Cys Ala Leu Ile Thr Cys Pro 645 650 655 Val Pro Asn Cys Gly Asn Pro Thr Ile His Pro Gly Gln Cys Cys Pro 660 665 670 Ser Cys Pro Asp Glu Ile Ile Val Gln Lys Pro Glu Leu Thr Ser Pro 675 680 685 Ser Ile Cys His Ala Pro Gly Gly Glu Tyr Phe Val Glu Gly Glu Thr 690 695 700 Trp Asn Ile Asp Ser Cys Thr Gln Cys Thr Cys His Ser Gly Arg Val 705 710 715 720 Leu Cys Glu Thr Glu Val Cys Pro Pro Leu Leu Cys Gln Asn Pro Thr 725 730 735 Arg Thr Gln Asp Ser Cys Cys Pro Gln Cys Pro Asp Glu Pro Leu Gln 740 745 750 Pro Ser Leu Ser Ser Asn Val Ser Met Pro Ser Tyr Cys Lys Asn Asp 755 760 765 Glu Gly Asp Ile Phe Leu Thr Ala Glu Ser Trp Lys Pro Asn Val Cys 770 775 780 Thr Ser Cys Ile Cys Met Asp Gly Val Ile Arg Cys Tyr Ser Glu Ser 785 790 795 800 Cys Pro Pro Val Ser Cys Glu Arg Pro Val Leu Arg Lys Gly Gln Cys 805 810 815 Cys Pro Tyr Cys Ile Glu Asp Thr Val Pro Lys Lys Val Val Cys His 820 825 830 Phe Asn Gly Lys Thr Tyr Ala Asp Glu Glu Arg Trp Asp Ile Asp Ser 835 840 845 Cys Thr His Cys Tyr Cys Leu Gln Gly Gln Thr Leu Cys Ser Thr Val 850 855 860 Ser Cys Pro Pro Leu Pro Cys Ala Glu Pro Ile Asn Val Glu Gly Ser 865 870 875 880 Cys Cys Pro Met Cys Pro Glu Met Tyr Val Pro Glu Pro Thr Asn Ile 885 890 895 Pro Ile Glu Lys Thr Asn His Arg Gly Asp Val Glu Leu Glu Val Pro 900 905 910 Asn Trp Ser Thr Pro Ser Glu Asn Asp Ile Ile His Ile His Arg Asp 915 920 925 Met Asn His Leu Gln Gly Glu Tyr Arg Ser Gly Asn Gly Pro His Pro 930 935 940 Ser Glu Asp Ala Ser Val Ser Ser Val Ala Leu Val Thr Val Pro Ile 945 950 955 960 Thr Ile Ala Leu Leu Val Ile Ile Val Phe Leu Leu Ile Asn Gln Lys 965 970 975 Lys Gln Trp Ile Pro Val Ser Cys Tyr Lys Ala Pro Thr Lys Pro Ser 980 985 990 Cys Leu Asn Asn Gln Leu Val Tyr Val Asp Cys Lys Lys Gly Thr Met 995 1000 1005 Val Gln Val Asp Ser Ser Gln Arg Met Leu Arg Ile Ala Asp Pro 1010 1015 1020 Asp Ser Arg Tyr Ser Gly Phe Tyr Ser Met Gln Lys Gln Asn Asn 1025 1030 1035 Leu Gln Ala Asp Asn Phe Tyr Gln Thr Val 1040 1045 5 5601 DNA Homo sapiens 5 ggcccggctg cgaggaggag gcggcggcgg cgcaggagga tgtacttggt ggcgggggac 60 agggggttgg ccggctgcgg gcacctcctg gtctcgctgc tggggctgct gctgctgctg 120 gcgcgctccg gcacccgggc gctggtctgc ctgccctgtg acgagtccaa gtgcgaggag 180 cccaggaact gcccggggag catcgtgcag ggcgtctgcg gctgctgcta cacgtgcgcc 240 agccagagga acgagagctg cggcggcacc ttcgggattt acggaacctg cgaccggggg 300 ctgcgttgtg tcatccgccc cccgctcaat ggcgactccc tcaccgagta cgaagcgggc 360 gtttgcgaag atgagaactg gactgatgac caactgcttg gttttaaacc atgcaatgaa 420 aaccttattg ctggctgcaa tataatcaat gggaaatgtg aatgtaacac cattcgaacc 480 tgcagcaatc cctttgagtt tccaagtcag gatatgtgcc tttcagcttt aaagagaatt 540 gaagaagaga agccagattg ctccaaggcc cgctgtgaag tccagttctc tccacgttgt 600 cctgaagatt ctgttctgat cgagggttat gctcctcctg gggagtgctg tcccttaccc 660 agccgctgcg tgtgcaaccc cgcaggctgt ctgcgcaaag tctgccagcc gggaaacctg 720 aacatactag tgtcaaaagc ctcagggaag ccgggagagt gctgtgacct ctatgagtgc 780 aaaccagttt tcggcgtgga ctgcaggact gtggaatgcc ctcctgttca gcagaccgcg 840 tgtcccccgg acagctatga aactcaagtc agactaactg cagatggttg ctgtactttg 900 ccaacaagat gcgagtgtct ctctggctta tgtggtttcc ccgtgtgtga ggtgggatcc 960 actccccgca tagtctctcg tggcgatggg acacctggaa agtgctgtga tgtctttgaa 1020 tgtgttaatg atacaaagcc agcctgcgta tttaacaatg tggaatatta tgatggagac 1080 atgtttcgaa tggacaactg tcggttctgt cgatgccaag ggggcgttgc catctgcttc 1140 actgcccagt gtggtgagat aaactgcgag aggtactacg tgcccgaagg agagtgctgc 1200 ccagtgtgtg aagatccagt gtatcctttt aataatcccg ctggctgcta tgccaatggc 1260 ctgatccttg cccacggaga ccggtggcgg gaagacgact gcacattctg ccagtgcgtc 1320 aacggtgaac gccactgcgt tgcgaccgtc tgcggacaga cctgcacaaa ccctgtgaaa 1380 gtgcctgggg agtgttgccc tgtgtgcgaa gaaccaacca tcatcacagt tgatccacct 1440 gcatgtgggg agttatcaaa ctgcactctg acagggaagg actgcattaa tggtttcaaa 1500 cgcgatcaca atggttgtcg gacctgtcag tgcataaaca ccgaggaact atgttcagaa 1560 cgtaaacaag gctgcacctt gaactgtccc ttcggtttcc ttactgatgc ccaaaactgt 1620 gagatctgtg agtgccgccc aaggcccaag aagtgcagac ccataatctg tgacaagtat 1680 tgtccacttg gattgctgaa gaataagcac ggctgtgaca tctgtcgctg taagaaatgt 1740 ccagagctct catgcagtaa gatctgcccc ttgggtttcc agcaggacag tcacggctgt 1800 cttatctgca agtgcagaga ggcctctgct tcagctgggc cacccatcct gtcgggcact 1860 tgtctcaccg tggatggtca tcatcataaa aatgaggaga gctggcacga tgggtgccgg 1920 gaatgctact gtctcaatgg acgggaaatg tgtgccctga tcacctgccc ggtgcctgcc 1980 tgtggcaacc ccaccattca ccctggacag tgctgcccat catgtgcaga tgactttgtg 2040 gtgcagaagc cagagctcag tactccctcc atttgccacg cccctggagg agaatacttt 2100 gtggaaggag aaacgtggaa cattgactcc tgtactcagt gcacctgcca cagcggacgg 2160 gtgctgtgtg agacagaggt gtgcccaccg ctgctctgcc agaacccctc acgcacccag 2220 gattcctgct gcccacagtg tacagatcaa ccttttcggc cttccttgtc ccgcaataac 2280 agcgtaccta attactgcaa aaatgatgaa ggggatatat tcctggcagc tgagtcctgg 2340 aagcctgacg tttgtaccag ctgcatctgc attgatagcg taattagctg tttctctgag 2400 tcctgccctt ctgtatcctg tgaaagacct gtcttgagaa aaggccagtg ttgtccctac 2460 tgcatagaag acacaattcc aaagaaggtg gtgtgccact tcagtgggaa ggcctatgcc 2520 gacgaggagc ggtgggacct tgacagctgc acccactgct actgcctgca gggccagacc 2580 ctctgctcga ccgtcagctg cccccctctg ccctgtgttg agcccatcaa cgtggaagga 2640 agttgctgcc caatgtgtcc agaaatgtat gtcccagaac caaccaatat acccattgag 2700 aagacaaacc atcgaggaga ggttgacctg gaggttcccc tgtggcccac gcctagtgaa 2760 aatgatatcg tccatctccc tagagatatg ggtcacctcc aggtagatta cagagataac 2820 aggctgcacc caagtgaaga ttcttcactg gactccattg cctcagttgt ggttcccata 2880 attatatgcc tctctattat aatagcattc ctattcatca atcagaagaa acagtggata 2940 ccactgcttt gctggtatcg aacaccaact aagccttctt ccttaaataa tcagctagta 3000 tctgtggact gcaagaaagg aaccagagtc caggtggaca gttcccagag aatgctaaga 3060 attgcagaac cagatgcaag attcagtggc ttctacagca tgcaaaaaca gaaccatcta 3120 caggcagaca atttctacca aacagtgtga agaaaggcaa ctaggatgag gtttcaaaag 3180 acggaagacg actaaatctg ctctaaaaag taaactagaa tttgtgcact tgcttagtgg 3240 attgtattgg attgtgactt gatgtacagc gctaagacct tactgggatg ggctctgtct 3300 acagcaatgt gcagaacaag cattcccact tttcctcaag ataactgacc aagtgttttc 3360 ttagaaccaa agtttttaaa gttgctaaga tatatttgcc tgtaagatag ctgtagagat 3420 atttggggtg gggacagtga gtttggatgg ggaaatgggt gggagggtgg tgttgggaag 3480 aaaaattggt cagcttggct cggggagaaa cctggtaaca taaaagcagt tcagtggccc 3540 agaggttatt tttttcctat tgctctgaag actgcactgg ttgctgcaaa gctcaggcct 3600 gaatgagcag gaaacaaaaa aggccttgcg acccagctgc cataaccacc ttagaactac 3660 cagacgagca catcagaacc ctttgacagc catcccaggt ctaaagccac aagtttcttt 3720 tctatacagt cacaactgca gtaggcagtg aggaagccag agaaatgcga tagcggcatt 3780 tctctaaagc gggttattaa ggatatatac agttacactt tttgctgctt ttattttctt 3840 ccaagccaat caatcagcca gttcctagca gagtcagcac atgaacaaga tctaagtcat 3900 ttcttgatgt gagcactgga gctttttttt ttttacaacg tgacaggaag aggagggaga 3960 gggtgacgaa caccaggcat ttccaggggc tatatttcac tgtttgttgt tgctttgttc 4020 tgttatattg ttggttgttc atagtttttg ttgaagctct agcttaagaa gaaacttttt 4080 ttaaaaagac tgtttgggga ttctttttcc ttattatata ctgattctac aaaatagaaa 4140 ctacttcatt ttaattgtat attattcaag cacctttgtt gaagctcaaa aaaaatgatg 4200 cctctttaaa ctttagcaat tataggagta tttatgtaac tatcttatgc ttcaaaaaac 4260 aaaagtattt gtgtgcatgt gtatataata tatatatata catatatatt tatacacata 4320 caatttatgt tttcctgttg aatgtatttt tatgagattt taaccagaac aaaggcagat 4380 aaacaggcat tccatagcag tgcttttgat cacttacaaa ttttttgaat aacacaaaat 4440 ctcattctac ctgcagttta attggaaaga tgtgtgtgtg agagtatgta tgtgtgtgtg 4500 tgtgtgtgtg tgtgtgcgcg cgcacgcacg ccttgagcag tcagcattgc acctgctatg 4560 gagaagggta ttcctttatt aaaatcttcc tcatttggat ttgctttcag ttggttttca 4620 atttgctcac tggccagaga cattgatggc agttcttatc tgcatcacta atcagctcct 4680 ggattttttt tttttttttt tcaaacaatg gtttgaaaca actactggaa tattgtccac 4740 aataagctgg aagtttgttg tagtatgcct caaatataac tgactgtata ctatagtggt 4800 aacttttcaa acagccctta gcacttttat actaattaac ccatttgtgc attgagtttt 4860 cttttaaaaa tgcttgttgt gaaagacaca gatacccagt atgcttaacg tgaaaagaaa 4920 atgtgttctg ttttgtaaag gaactttcaa gtattgttgt aaatacttgg acagaggttg 4980 ctgaacttta aaaaaaatta atttattatt ataatgacct aatttattaa tctgaagatt 5040 aaccattttt ttgtcttaga atatcaaaaa gaaaaagaaa aaggtgttct agctgtttgc 5100 atcaaaggaa aaaaagattt attatcaagg ggcaatattt ttatcttttc caaaataaat 5160 ttgttaatga tacattacaa aaatagattg acatcagcct gattagtata aattttgttg 5220 gtaattaatc cattcctggc ataaaaagtc tttatcaaaa aaaattgtag atgcttgctt 5280 tttgtttttt caatcatggc catattatga aaatactaac aggatatagg acaaggtgta 5340 aattttttta ttattatttt aaagatatga tttatcctga gtgctgtatc tattactctt 5400 ttactttggt tcctgttgtg ctcttgtaaa agaaaaatat aatttcctga agaataaaat 5460 agatatatgg cacttggagt gcatcatagt tctacagttt gtttttgttt tcttcaaaaa 5520 agctgtaaga gaattatctg caacttgatt cttggcagga aataaacatt ttgagttgaa 5580 atcaaaaaaa aaaaaaaaaa a 5601 6 4101 DNA Mus musculus misc_feature (3600)..(3610) n=any base 6 caggggaggg cgcccgcccc gctcccggcc cgactgcgag gaggaggcgg cggcgcagga 60 ggatgtactt ggtggcgggg ggcagggggc tggccggctg cgggcacctc tcggtctcgc 120 tgctggggct gctgctgctg ctggcgcgct caggcacccg ggcgctggtc tgcctgccct 180 gtgacgagtc caagtgcgag gagcctcgga gctgcccagg aagcatcgtg cagggcgtct 240 gcggctgctg ctacatgtgc gcccgccaga ggaacgagag ctgcggtgga gcctatgggc 300 tccatggagc ctgcgaccgg gggctgcgct gtgtcatccg ccccccgctc aatggcgact 360 ccatcaccga gtacgaagtg ggcgtctgcg aagatgagga ctgggatgat gaccagctaa 420 taggttttga accctgcaat gaaaacctca tctccggctg caacataatc aatgggaaat 480 gcgaatgtgg taccatccga acctgcaaca atccctttga gtttccaagg aaggacatgt 540 gcctttcagc attaaagagg atcgaagaag agaagccaga ttgcagcaag gcccgctgtg 600 aagtgcggtt ctctccacgt tgccctgaag attccattct gatcgagggc tatgctcccc 660 ccggggagtg ctgtccttta cccagccgct gcgtgtgcga ccctgcgggc tgtctgcgca 720 aagtctgcca gccaggatac ctgaacattc tagtgtccaa agcctcaggg aagccgggag 780 agtgttgtga cctctatgag tgtaaaccag ttttcagcgt ggactgcagc accgtggagt 840 gcccccctgt ccagcaggcc gtgtgccccc tggacagcta cgaaacgcaa gtgcggctca 900 cagcggatgg ctgctgtacc ctgccagcaa gatgcgagtg tctctctggc ttatgtggtt 960 ttcccgtttg tgaggtggga tctactcccc gaatagtctc tcgtggagat gggacacctg 1020 gaaagtgctg tgatgtcttt gaatgtgtta atgaaacaaa gccagcctgc gtgttcaaca 1080 gcgtggagta ttacgacgga gacatgtttc gaatggacaa ctgtcggttc tgccgatgcc 1140 aggggggtgt ctccatctgc ttcacggcgc agtgtgggga actgaactgc gaaagatact 1200 atgtgcctga gggggagtgt tgccctgtgt gtgaagatcc catctatcct cttaacaacc 1260 ctgctggctg ctatgccaat ggccagatcc gcgcccacgg ggaccggtgg cgggaagatg 1320 actgtacctt ctgccagtgt atcaacggag aacctcactg cgtggccacg gcctgcgggc 1380 agagctgcat gcacccagtg aaagtgcccg gggagtgttg ccccgtgtgt gaagaaccaa 1440 cctacatcac gattgatcca cctgcatgcg gggagttgtc aaactgttct ctgaaggaga 1500 aggactgcgt ttatggcttc aaactggatc acaatggctg tcgaacctgt cagtgcaaaa 1560 tcagggagga actctgctta ggcctcaaaa gggcctgcac cctggactgt ccctttggct 1620 tcctcaccga tgtgcacaac tgtgaactct gtcagtgccg cccacggccc aagaagtgca 1680 gaccaacaat gtgcgacaag ttttgtccgc ttggattcct gaagaataag catggctgtg 1740 acatctgtcg gtgtaagaaa tgtccagagc tgccatgcag caagatctgc cccttgggct 1800 tccagcagga cagtcatggc tgtcttatct gcaagtgccg agaggtccct ccttcagccg 1860 ggccacctgt cctgtcaggc acatgtctgt ccatggatgg ccatcatcat aagaacgagg 1920 agagctggca tgatggttgc cgggaatgct actgtcacaa cggaaaggaa atgtgcgctc 1980 tcatcacctg tcccgtgcct gcctgcggca accccaccat tcgttccgga cagtgctgcc 2040 cgtcttgcac agatgacttt gtagtgcaga agccggagct cagcacccct tctatttgcc 2100 acgcccccgg aggagagtac tttgtggaag gggaaacttg gaacattgac tcctgtacac 2160 agtgcacctg tcacagtggt cgagtgctgt gtgagacgga ggtgtgccca ccattgctct 2220 gccagaaccc ctcccgcacc caggactcct gctgcccaca gtgtacagat gaccctcctc 2280 agccttccac atcccataat gagagcgtgc ctagctactg caggaatgat gaaggagata 2340 tcttcctggc ggctgagtcc tggaagcccg acgcctgcac cagctgcgtg tgcgtggata 2400 gcgcaattag ctgctactct gagtcttgcc cttccgtggc ctgtgaaaga cctgttttga 2460 ggaaaggcca gtgttgcccc tactgcttag aagacacaat tccaaagaaa gtggtgtgtc 2520 acttcagtgg gaagacctat gctgacgagg aacggtggga tattgacagc tgcacrcact 2580 gctactgcct gcaaggccag accctctgct cgaccgtcag ctgcccaccg ttaccctgtg 2640 ccgagcccat caaggtggaa gggagttgct gcccaatgtg cccagaaatg tatgtgccag 2700 agccaaccaa tgtacccatt gagaagaaaa atcatcgtgg cgagattgac ctggaggtcc 2760 ccatgtggcc caccccaagt gaaaatgaca tcatccatct ccctagagat atgggtcacc 2820 tccaggtaga ttacagagat aataacaggc tgcatccagg cgaagactca tcactggact 2880 ccattgtctc cgttgtggtt cccataataa tatgcctgtc catcatcata gctttcctgc 2940 tcatcaacca gaagaagcag tgggtaccac tgctgtgctg gtaccggaca ccaaccaagc 3000 cttcttcctt gaataatcag ctggtatctg tggactgcaa gaaaggcact cgagtccagg 3060 tggatggtcc ccagagaatg ctaagaattg ccgaaccaga tgctcgattc agtggcttct 3120 acagcatgca gaaacagaac catctacagg cagacaactt ctaccaaacg gtgtgaacac 3180 gggcagcagc gccgatgagg tcccgaaaga cggagaaaga cgaaaatctg ctctttaaag 3240 taaactagga tttgcgcact tgcttagtgt cttgtgtcag attgcgactt ggtgtccagc 3300 gctaggggct ggactgggat gggctctgtc tacagcactg tgcagaacaa gcattcccac 3360 ttggcctcaa gataactgac cacatgtttt catagaacca aagttttaaa acttgctaaa 3420 gtatatttgc ttgtaagata gctgtagaga catttggggg aggggagaaa gagtttggtg 3480 ggggaaatgg gttgggaggg tagtgttggg aagaaaaaaa aatggtcagc ttggctcggg 3540 gagaaagcca gtaaaaaaat aaaagcaatt tatggcccag aggaatttct ttttcctgtt 3600 actntcaaga ctgttggttg ctgcaaaggt agggcctgaa tgagcaggat cacaaaaggc 3660 cttgcaaccg agctgccata ccacaccaga caagcgcatc agaactctcg atagccatcc 3720 caggtctaaa gccacaagtt tctatagagt cacagccgca gtaggcagtg aggaagccag 3780 ggagatggaa agcaacagtt ctcgaaagcg ggttcttgag gatgtattca ctttttttgc 3840 tgctgctgtt ttcttccgtg ccaaccagcc agttcctagg agacacagca ggttgagtag 3900 gaacgaggtc acctcttgac ctgaccactg gagctttgct cgctacgtga caggaagggc 3960 aacggcgaag gacaccaggc atttccaggg gctacacttc attgttcctt gttgttttct 4020 tctgtgctat cattggttgt tcatagtttt gttaaagctc tagcttaaga agaaactttt 4080 gagaaaaaaa aaaaaaaaaa a 4101 7 4010 DNA Gallus gallus 7 ccgcaggcag gaggcgagtc gggactcggg cagcgagcgg ggcagaggcg ccgccgccgc 60 cgcacccagg ggctcgcagc cgctcgcgcg cgatcgccgc tcctccggcg tggacggggc 120 cggcagccgg cgggcgagga acgcacggac cgacggacgc cgcttcccgc aggggacaag 180 cgggcggagc cgccgcccgg agcagaagtt tggctggagc ggccgccgcc gagagaggaa 240 actttcctcg gggggccgga ggcggggcgg cgggaaggag ccgccgctcg gcggcttttc 300 gccggcgcgc tcgggggctc cggcgcggct ccccccggcg cgggccgggc ggcggcggcg 360 gccgggatgt acctggcagc ggtctcggcg gggaggaggc ggccgggcgg agacggcggc 420 ggaggtggcg gcggctggca cctggcggcg gcggggtggc tgctgctgct ggcgctgctg 480 ctgggccagc cggggacgcg ggcgctcgtc tgcctgccct gcgacgagtc caagtgcgag 540 gagcccaaga gctgccccgg catcatcgtg ctgggcatct gcggctgctg cttcatgtgc 600 gcccggcagc gcaacgagag ctgcgggggc gtctacgggc tgcacggcgc ctgcgaccgg 660 gggctgcgct gtgtcatccg cccgccgctc aacggcgact ccatcaccga gtacgaagtg 720 ggcgtctgtg aagatgaaaa ctgggatgat gatcagctcc ttggatttga gccatgcaac 780 gaaaacctta taacaggttg taacataatc aatgggaaat gcgattgtga cactattcgg 840 acctgtaata atccatttga atttccaagt cgggatactt gcctgtcagc cttgaaaagg 900 attgaagaag agaaacctga ttgctccaag gcccgctgtg aggtccagtt ttctcctcgc 960 tgtccagaag attccatctt gattgaaggt tatgctcctc ccggggaatg ctgtcctctc 1020 ccaagccgtt gtgtgtgtaa ccctgcaggc tgcttgagga aggtttgcca gcctggctat 1080 ttgaatatat tggtttctaa ggcttcagga aagccagggg aatgctgtga tctctatgaa 1140 tgtaaaccag tgttcagtgt ggactgcagc acagtggaat gccctcctgt tcagcaagtg 1200 gtttgcccac tggatagcta tgaaacccaa gtcaggctga cggctgatgg ctgctgtacc 1260 ctgcccacaa gatgcgagtg tctctctggt ttatgtggtt tccccatgtg cgaggcaggc 1320 tccgttcctc aaatagtctc gcgtggagat ggaacgcctg gcaagtgctg tgatgtcttt 1380 gagtgtgtca atgaagtgaa gccaacttgc atatttaaca gcatggaata ttatgatgga 1440 gacatgtttc gaatggatgc ttgtcgcttc tgtcggtgtc aaggtggtgt ctctatttgc 1500 ttctctgccc agtgtggtga gctacactgt gacaggtatt atgtgccaga gggagagtgc 1560 tgcccagtat gtgaagaccc agtttaccct gtaaataatc ctgctggctg ttacgccaat 1620 ggtcagatcc aagctcatgg agaccgctgg agagaggatg actgtacttt ctgccagtgt 1680 atcaatggaa atcctcactg tgttgctaca gcctgtgggc agagctgttt gaatcctgtc 1740 aaagtcccgg gagagtgctg cccagtatgt gaagaaccaa cttacataac aataggtccg 1800 cctacctgtg agcttttggt gaactgcact ttgactgaga aagactgtat ctatagtttc 1860 aaactggatc aaaatggttg ccgcatttgt cagtgcaaaa ccagagaaga gctgtgtacg 1920 ggcctcatat caggctgctc cctggactgt tccttcgggt tccagactga tgcccacaac 1980 tgtgagatct gtcagtgccg tccacggccc aagaaatgca aacctattgt atgtgacaag 2040 tactgtccct ttggatactt gaagaacaag catggctgtg aaatctgtcg atgtaagaaa 2100 tgcccagaaa tgccttgtgg taaaatctgc ccaatgggct tccagcagaa cagccacggg 2160 tgtgttatct gcaaatgcag agaggcaact gcttctctta tgcctccagt taaaacaggc 2220 tcttgcctgt caatggatgg gcgccgacat gaaaatgagg agagttggca cgatggatgc 2280 agggaatgtt actgtcacaa tggacgggaa atgtgtgcct tgatcacctg ccctgttcct 2340 aattgtggca accccaccat acatccaggg cagtgttgcc catcatgtcc agatgaaata 2400 attgtgcaga agccagagct cactagtcca tcaatctgtc atgctcctgg tggagaatac 2460 tttgtagaag gagaaacgtg gaacattgac tcttgcacac agtgtacatg ccatagtggc 2520 cgagtcctgt gtgagactga agtctgtcct cctctgcttt gtcaaaaccc cactcgcaca 2580 caggactcct gctgtcctca gtgcccagat gagcctcttc agccttcttt gtcaagcaac 2640 gtcagcatgc ccagttactg caaaaatgat gaaggagata tttttctgac agctgagtcc 2700 tggaagccca acgtctgcac cagctgcatc tgcatggacg gtgtaattag atgctactct 2760 gagtcttgcc caccagtatc ctgtgagaga cctgttttga gaaagggaca gtgttgtcct 2820 tactgtattg aagatacagt accaaagaaa gtggtctgcc acttcaatgg aaaaacatat 2880 gcagatgagg aacgctggga cattgacagc tgcacgcact gctactgcct gcagggccag 2940 actctctgct ccactgtcag ctgcccaccc ctcccctgtg ctgaacccat caatgtggag 3000 ggaagctgct gtcccatgtg cccagagatg tatgtacctg aacctactaa catccctatt 3060 gagaagacaa atcatcgagg agatgttgaa ctagaagtgc caaactggtc aacaccaagt 3120 gagaatgaca tcatccacat tcacagagac atgaatcatc tccagggaga gtacagaagt 3180 ggaaatggac cacaccccag tgaagatgcg tcagttagct ctgttgcctt ggtgacagtt 3240 ccaatcacga tagctctgct agtaatcatt gtctttctgc tcatcaatca gaaaaagcag 3300 tggattccag tgtcctgcta caaagctcca acaaagccat cttgcctaaa caatcaactg 3360 gtgtatgtgg actgcaagaa aggtaccatg gtccaggtgg acagttctca gaggatgcta 3420 agaattgcag acccggattc aagatacagc ggattttaca gcatgcagaa acagaacaac 3480 ctacaggcag ataatttcta tcagacagtt tgaagaattg cacccttacc aaggatttaa 3540 gaatgagaga gagatagaga gagagtgaga gagagagaga gagagagaga gactaagtct 3600 gctattaaaa taaagctaga attgtgcact tgcttagtgg attgtattgg gtttgtgact 3660 acatgtacag ctctaagacc ttactgggat gagctctgac tcctgcaatg tgccagaaaa 3720 agcattccca cttttccttg agataactga ccaagtgttt tcttagaacc aaagttttaa 3780 aacttgttaa ggtgtatttg cttgtaacat agctatagag tttttttggg agggggaatc 3840 aggggctagg ggtaggcaat gaggtaaaag gaagtttcag agaagaaaag ctggtcatgc 3900 ttggctggag aagaaaatga acaaatactg ccaagcagca ggacagtggc ttttcttgtt 3960 gtcctgaaat tgcatctgtt gcagcaaatc ctaaccccag gtaaaaaaaa 4010 8 41 PRT Artificial C-terminal CR1M1 peptide 8 Arg Val Gln Val Asp Ser Ser Gln Arg Met Leu Arg Ile Ala Glu Pro 1 5 10 15 Asp Ala Arg Phe Ser Gly Phe Tyr Ser Met Gln Lys Gln Asn His Leu 20 25 30 Gln Ala Asp Asn Phe Tyr Gln Thr Val 35 40 9 22 PRT Artificial N-terminal CR1M1 peptide 9 Lys Val Cys Gln Pro Gly Tyr Leu Asn Ile Leu Val Ser Lys Ala Ser 1 5 10 15 Gly Lys Pro Gly Glu Cys 20 10 22 DNA Artificial Crim Primer 1 10 gctcagcacc ccttctattt gc 22 11 21 DNA Artificial Crim Primer 2 11 gtgatgagtc tcgcctggat g 21 12 18 DNA Artificial Crim Primer 3 12 ggaatcttca gggcaacg 18 13 22 DNA Artificial Crim Primer 4 13 cacagcgggc cttgctgcaa tc 22 14 22 DNA Artificial Crim Primer 5 14 gccggagatg aggttttcat tg 22 15 21 DNA Artificial Crim Primer 6 15 ccgccagagg aacgagagct g 21 16 21 DNA Artificial Crim Primer 7 16 ctaccaaaca gtgtgaagaa a 21 17 20 DNA Artificial Crim Primer 8 17 tggtcagtta tcttgaggaa 20 18 20 DNA Artificial Crim Primer 9 18 ctcgctgtcc agaagattcc 20 19 20 DNA Artificial Crim Primer 10 19 ggttgccgca tttgtcagtg 20 20 50000 DNA Homo sapiens 20 gaattcagca gtcgtcgccg cccccccccc ccatgactat ctttttatac cttgtctaaa 60 tgatttctac aaagtttgta tattttgagt aatgttttag gtctgatatt attttgcatt 120 gatttgtagg aattataaat gtaaccattt atctgtgaac aaataattaa aaatacttaa 180 tttggggagg aggcgggaac ctagatgtgc tataatcagc agtgtgcact ttgcttcaca 240 ttatcttctg ggaggaaatt ttgctcctag ctcctgagtg tgcacactga tgaccactgc 300 aggatttggc ctggcttgag caaagaaaca gctcgacgcc atgggggcta tggttccaac 360 ctgctgctgc agttggtata aactaggcaa aaccagaagt cagaggatta tccattcctc 420 ttgttctttt tctagctcta cctgttcttc tgccattctt ctgcctgaaa atattcatga 480 gcatcgccta tgggtcttgt ctctgtgctt tacagtagtg acatcacttg tgtaagccag 540 tttttctgtt agtcgtcggt acctcctagg ttcgctgtgg gggttaagtg agataaaact 600 gtgcttctga aattttagtg tgcacacaaa tcacctaggt atttaaaatt gcaagtctga 660 ttaagttagt ctggggtgat gcctgtgatt ctacatttct aacaaatttc tggctgatgc 720 cagtgtcaca ggtgccagga ccacactttt aagtagcaca aagctagtac ctatgtatac 780 agcatgtatc acagagtatg catattaaaa gacctgataa atgttacctg ctattatagt 840 cattattttt cagccaaaat aaataagata tggtccttgt tttcacacag gccatgatct 900 gatggtaaac tggaaaagca gacaattcat tacaatcagt tcagtaagcc atgagataac 960 ggaaatgcca agcatgtcag gagcaagtaa gagggagtct aacccaaatc tgaccaccag 1020 gaaaagttta gtggaggatg cagcaacaag aaagacctga gggtctcctc tcctccctcc 1080 ttttttcctg cacttcttgt gcaggatgtt ttaggaaaaa gaaaataggg cttggggcta 1140 gacaaacctg cattgaaatc ttaaatctgt tattttttta atcgtgagat cagcttgttt 1200 agtttctctg agcttcagtt ccctcatctg tgaaactgga atattaatag ctttctatta 1260 agtaaatatt gtagtgagga ttagagattt atatatatga ttacatagat gtatatataa 1320 gattgtgtgt gtgtgtgtat agtatgtatg tgtatatgct agtttctggt accaaataga 1380 ttttttaaaa taatattaat tttaatttcc attctatttt ctatcctttc ccattttcct 1440 ctttgttagc tttctctgta agtttaaaaa tttaccttgt gggcacacat ttcctccaaa 1500 cttaactaaa tctcttcaaa ggggtgggag tttcgagggg gaattcaaat gtcagatgtt 1560 tgtcaaagag aaaagaaccg tgaagtaccg agaacagggg gctgggaata cagggtctat 1620 gcaccaagga aacatgcgtt ggtttgatcc agtgcaacta cagttcccag cgcctttggc 1680 atttcagcag aatgtgaaca cttgcttgat tttgtggttt gagcccctac atccaaagag 1740 ctatttcaaa ttaatattta tttaaaaatg gccttttaat attaatggca tatggtttta 1800 tgacagtgaa ttaaatttcc ttttattgtt agaacttttt gaggatttct ggacatcact 1860 ggagacgttg gagaccccaa ggatatagcc aggccaggtt gatatatctc actgaggtta 1920 cagaaggaag cagcggcaag cctagccgat tcagagacca ctgcgggctt gggaatgagc 1980 aggagagggt gtgagggcca gaggccatct gtgttgcttt tctctctccc agtcattcta 2040 gccatctttg atgtggacta atagtcgagg ttggccatgg gtttgcactg tctctaggga 2100 gtctttgtga tatagtgcgg ggtaggaaga acacctagaa tcaaaatatc taatgttact 2160 agctgtttag ttcttagtgt gggtatggtc ccattttagg agaacagaac tatgaactat 2220 gtttgtgttt tttttttttt tttgagacag agtctcactc tgttgccagg ctgaagtgca 2280 gtggcatgat ctcagctcac tgcaacctct gactccctgg ttcaagcgat tctcctgcct 2340 cagcctcctg agtagctggg attacaggca tgtgccacca cgcccaacta ctttgtgtat 2400 ttttagtgga gaccacgttt caccatgttg gccaggatgg tctctcgacc tcgtgatctg 2460 cccaccttgg cctcccaaag tgctgggatt acaggcgtga gccaccgcgc ccagcccaga 2520 actacatttg cttaactgtg actctttcct ttgtacctga aaataagtat catttttatc 2580 cagtaaatat tggctgataa ttcataaggc agatgcttta aattggctat tgtagaacaa 2640 ggagaaagaa cttgatgaag agaaaaatcc ttggaatcag tggtttacaa cttttttttt 2700 ccttcctgat gatcaaaccg tattatcgca tagtagaaat tggattttct tccagaaact 2760 actttagtac ctctcatttg tagcacttct atcaagagtg ctggtattcg gacatgtaac 2820 ctacaaacaa aatcagaaag atgtcaaaat aacaccctaa ttcagactat cctgagaaca 2880 tttgtcttct gaatatttaa gaaaaaaaaa taagtgttaa ttatattgta gggtgttacc 2940 attttgtatt tcaagttcct gagaagagaa tttgaacagt ttgctatttg gaaattttag 3000 caaccagcta ccttgcctat ggaaagatta aaaaaaaaag gagagggagt agatggtggt 3060 ccataggctg taactggaaa ctatgcctgt cttatttagc atttcaaaac aaaaaccata 3120 aacaaacatc tcagattcaa aacagaaagg aatttctggc agggcagaat gaaacctgag 3180 actgcagagt cccagtctac aagaggtaac ttatttagct tcttatccaa aggccacagc 3240 aaaacgtttc acagctgtga ttaaacaaaa ggcatctttt gttggagaat tgtgatactt 3300 ctttttaaga ctgcgaaaga cctgccaaaa taaggcaaag ctaccttgag gatgtttgac 3360 ctgaactaga ctttatgaag tagttggacc acttcaggcc ttatggtgtg aggagtgtgg 3420 gatgcatagg cttggacctt tctgtgtgtg tgtttataag tctgtagaga cagaatattc 3480 catcatgtta tcaaatgcag atctcgaaaa ccttttctca cttcccatat ttgggagctt 3540 tgcttttttt gaaaagtatg ggcttttaca gtctgagtga ggaaaaggga tggattttag 3600 acagttttgt ctgtggaagt caaagtagaa ggatttttaa acagttggag acagaatggg 3660 tatactgtgt gcttttcaga accactgagt tcctatattt caggcagatt ttaagaaaac 3720 ttattaccac tacctatcac atcaaaatga aagaccaaat aaatttctgt gttttaatca 3780 agactgtgac tgtatttgtg tatatcctag cttgtttcag aaagggtttg aggctgccca 3840 gaaaggtgaa tattacagaa ttcttgtatg tgataacatg ataaaggaga aaaaaacagc 3900 catggctttt gaatccccct tcatctctta atagcccagt gacttaaata aaacccttca 3960 acttcccttg gcatgacttt gccttaattt gtcaaaaggg aattaaatag ttacctacct 4020 caagacaggg gaactgaatc agatgatatc ttaaggttct actggtcata agatctatgc 4080 ttttatgacc tagaagtgat agacaggagt cgaatattac cttgtgtttg cagagccagg 4140 gaaggaaaaa gtgatagtgt ctacatgact gccaggtgct gtagacagaa gggcttagag 4200 gaaaagatca agagttcctc catggttatg tttatcaagc ggcagtggag acagtaaagc 4260 gtagtattgg gaagttagtg acagctttta aatgcctgaa gtcttttaaa tcatattcac 4320 cgtggctgag tctggattca gggtagaact caaacctctc ttatgtgttt gctgccttcc 4380 ctttctccat tttcttatgt acttcattaa ttattaagac tatgaggtag gtaggtgaca 4440 gagactgcaa agggagaagc tagagcgaag acaggccaaa acttgcccaa attcacatgc 4500 tcatttctgc ctctgacact aggagagagt tgagtaattg ctcttcttcc taatactgca 4560 ggcagcagcc aaattctcca cattctttta cctttgctgc tgttttcact tgccagggag 4620 tttccttaag gagcagactg aatttaaaga aacatagcag taatagtgca tgaaaggggg 4680 tgtggggtcg ctggccaaac gtggcatgtt ggtgcctcac gttgggagga tgaaaggatg 4740 attcctgcag agggacatgc tgcacactgc gcaagctccc tcacctctgt gtacagctga 4800 gccactgacc aggaagctct ttgcttgtat gtgagacagc aaatattcac taaatatgtt 4860 tcaagtacaa gccaactgcg aagcttggaa agttgtccag aaacttcacc cttcctttgc 4920 atgttgtcca gcggcaggac gttgagcaag gttaggtagt atgaacctgc ggctctaaaa 4980 cttaaagtaa taggcattat tcgaaagaaa atgatgccca gcacggcttc cagtgtatat 5040 tagcaatcag cttgtttgcc ttgtggttat tacatttttt gccaagagca gccttccaga 5100 atgaggtact agttaattcc cattgggcat ttattcatta ataagtctgc tgaatgtgaa 5160 atattccatt tgggagtatg tcatatgaga aaactcagtg gctaattatt acatagctct 5220 cttccatgat tctcactttc agcgcagtta cctaagccac catttcctag gtatgtaaac 5280 ttagataatt tttagttttc tgtaatactt tttgcttttt gctccagaaa attttttttc 5340 actccaatct gtgaaattgt taaaagttta ctaataactg gataaacatt ttatgttgta 5400 ctatggcttg ctctttcttt ccattttaaa aagataatat caaaagttaa atcctgttcc 5460 agatcttacc gtctctcttc caggctagat ttctgggatg gggtggtgtt aatttactaa 5520 cctgagcttt tgcccaatct agggtctcct ttactctttt ctggtgtata aataggtctt 5580 tatttgcttt tcttccaggt tggttggtgc atattagctt aagaactttt aaacctctgc 5640 ggcagtaatg agaatctgca tgccaagggc ccagcatttc tctacaaagc agcccatgag 5700 tgcatgtgtg tttgctgtag tgaagggtct atgttagaag gtgagttaga gaccccactt 5760 gcccacatct gtgtgctgtg gtttggccct cctcctctgg tgaatcccta ccacagccca 5820 gtgcctgctg gatctcaagt gtcctgaaag ccattggaaa gatgctttcc agcagtctta 5880 acatttgcct ccatgttctt tccagcaagg atgagtgcac acagctttcc agctactggg 5940 caacccagcc attgacagaa ggtggtgcaa gagtcaattc ctctgtgagg atcccttcca 6000 ataacagtgt ctgcattaac gacaaaaagt tctgtaaaaa tgtttgcagg gggagctttt 6060 gacctgcaaa catttctaca gaactttttg ccatatagga acatatgaaa tgttatatgt 6120 aatataggtc agaaatgttg ccatatagga acattcgaga agatgataaa tgtgaggaaa 6180 gtgataaaag tttatacaac aaggatctgt acaagattgt ggtagatttg ctggcctaaa 6240 aagatgtact cagggtagtg aactatttct gtagcatcag ccactgggcc cctgagtggg 6300 aatactcttt gagagacagg aagctaattc cttctttgaa tgtgtagtta actttgtgcc 6360 attggatgta gggtgtgcac gtgcatgtgt attttttcag ttgctgaggg cacaacacag 6420 acaaagaata ccgtgcattg tattatggat gtgctggaaa caacgcccca gcttgaagca 6480 agttgactgt agtatctact ctgtagatca gcaaactgaa aatgacttct taataaaagg 6540 tcgcacattg ttttacttac tgcatttgaa aaaacagttc taattaagat gctttgggtt 6600 ttaaactgcc tgttaattgg gatatctata agtgggcaag tagagcttct gaatattgct 6660 gacttatttc ttagtgattg ggacaaaagc tttttttttt tttttttaaa tagtatgcca 6720 tcattaaaca gtttttgacc ttacgcagaa actttaaatg tgacagcaga aagaagcagc 6780 agaaatttac agtgagaaat ggaagacctc tgtaggtaaa taagtaaaat gaaagaagcc 6840 agaggtgttt tttgttggca gtagatgaaa ttattttcaa acactgccct gggaaaaggt 6900 gagccctcag caaccaagtt tattgactgt ctattcagga aattgtatta gttgctttgg 6960 aaagagaaag gaaaaagaaa agactttctg ccaactgagg gctggaaaac ttgagggaga 7020 taaagtattt acatgcttta ataacataaa gcatgtccaa aagaaataca taattacagc 7080 tatgtttggc actgaaaata atccaggtta ttatattttt aaaattgtgt gtgaatatgt 7140 aagtgaatgc ccttgagtca gaaatgcttg agtgataata tttcttgaga gaatggaaaa 7200 gtagatggta ttaggagcta aaaagagaga gaagggataa aggatgacca ctagttaaga 7260 aggaacagtg aacttttttt ctttagaggg gacatatttc atatgcatag ttattccccc 7320 tctacttcta atcttagtat aataagcatt taacttccca taggacctta gcagtctgca 7380 tgttattact aaaagcccac aggtaaggga aaattagacc cctaaattaa agacctgacc 7440 atcttgctcc tgaagtttta ggttttccaa ctttatttcc agtgtatctt ggacaaagac 7500 cctttgagag ctcagaaaat taggagatgt ggtcaacaca ttcttcttaa tatctagtgc 7560 aaccaaagac ttggccagca actaaggcca gcaactggaa aatgggtgtg gtgtcaggtt 7620 tcagacgccc ccatagttgt cttcacgtcc tttaatatgt gttggcatcc ctcaagccct 7680 gccagtttat gctctttcct ttgtgaagcc ttcctgccct gcccaggaaa aagtaatgcc 7740 atcccccatc ccccatcccc cagtgaccat aaatagcact ttctccttcc cctgttatag 7800 cacttttcat actgtgttag agttcacatt tctgtctctc tgcctcattg tgatctattt 7860 gagtgccagg attgtgtgtt attctagcgc ttggcacaca ggaagaactc aatgagcatt 7920 tgttgactaa atctagactc aaggtcatcc aagatattat acaaaaactt tttcatcgaa 7980 ccatttggtc tgcattgtgg aaactgaaac gaatggcact ccccgttaag agaatcccag 8040 tgagaagcca tagcattgca ggaggaagct caaactggtt aactttgctc atgccgatca 8100 gtcttagaat tttttgtaat gtttccatga tgaattagtt tcagctgtat ttttttggca 8160 aatatttaaa gataattttt atttgtgtgg acttttcttc ttgtcagttt gtcataaagg 8220 tgtcatcttt cattcactaa ccccttatca gcttatagga attttcctcg tgggtggttg 8280 cttgtaacct tctcttgact gccagggatc tgatatacta aacggttgag atactgtggt 8340 gatcagctgc ttgtaaacaa taactgataa atcggttata aattatcaac acacttcaga 8400 tgaccatatc atcatgactt ggtgatagca aaatagactt ctgaattctc agctaatcca 8460 ggattgtgtc ttaacaaaca tttatgttga ttaatcagga gggtgtttag tttaagaaac 8520 atgtactgaa tgttcaatat gtgagtgcca agtgcagtgt ggtgcaacct gtttactgct 8580 tcaggtgatc tccagatatc ctaaggaatt tttaactgta cagcttcttt ggggaaggaa 8640 atcctaagcc aacaacctga gggtggaagt gtgtgactgg gaataattaa aatgcctcac 8700 aatggttcga gcaggaattc cataggaatg cactttaagg catttaggaa agcaaaattc 8760 cgtttggatt tactccaaac cctatgactt cagagacagg aatttttgat gattagtggc 8820 tggagcccca acttgttatt tagtaattcc acattcagtg tggaattgtt ttttctgtag 8880 cactagttat aagttctttg tttgtttttc taaaggtttt aaggaggttt acaaattaag 8940 atggaataag acattagaaa cttaaggata catagcaaat taaaaccttg atacatttgg 9000 tgctaagctt ttcagctgct aaagatagag agttttggcc aagataaagt atatagagtc 9060 aaagacacat gcagaattct cctggatata agttcaggta cctgtgccat tttatcactt 9120 tagtcaatgt gcccattttc ttagggcagg tttaattttc tgctgcggca tggcccttca 9180 gctttcccaa gtttacgatg ctcatctcat acctgatgga aattgttgat aatgttagtg 9240 tatggttggt ggctttccca gtagcccact tcagatcctt ttattggtca tcaccagtta 9300 caaaagtttt aggaaaggga aggagagaag ctggaactga tggctaagta aagttatagc 9360 tgtagagctc tggttgtcca taatgaacag tggcaatgaa cagtgaagat ttaagcaagg 9420 ttattttttg agggtccact gatgattttc ctataactcg ttaaaaatat ttttaaaact 9480 tttatatggc ttcatataaa aaaagcttaa aaatctgttc ccagacttcc taggcaggta 9540 tgttttcttt tcagtgttgc cttctctaag actttatcag cagaaggagt atttgagctg 9600 atggcaaatg ctgccaaaat gagaagtgtg tgtgtttctc aaaactcatg atcccacagt 9660 taaatcgagg gtgtgggtaa aggacccaga actgttcttg ctgatggaat agcacacttt 9720 tcctcagaca ttagtagagt aaatgaatgg aggcagagga ttgggaaaca ctcttgaagg 9780 agggagtgcc catcagcctg taagccaaag taaacagttc ttgtaactga aaaatagact 9840 tcctggaagg agagaaccag agaagtgggt gtgaagtgca accagcaagt accattttca 9900 gtgatacaga cagctggctt aacctgagtc ctgacaagaa cctctcagag ctatttgggg 9960 aaatcagttg cttgaaccag cctttgactg gcctatcgtt cctacagctg tacccaaaga 10020 tggcacttgg ggagaaaagg tcacatttgt ttgtgataga gaaagcaaac cgatggggca 10080 ttttgagagg tagaaatgaa gtcatcattt gcggagccgg tggaccagca gaatggccgt 10140 gggaaggtat ccctgaacct gtatgtgtgg aaatgcaggc attctctgtc agtcttttgt 10200 cacagtgaga aaatgtcatt acagtggaca gctatgcaca cgttggagat tatgtgatgt 10260 gtgcatgtgc atccatgtgt acaacataca cccatgcaaa aattgctcac agggtcatgt 10320 gcgtgtcctg gtagtctgtt atgtccattc ttctgtaata ggagtatgtt tttcaaactg 10380 aagctgttag gttctgttaa ttccaacaga acaaaaaaaa atctgcctct ggccatcatt 10440 cgttaattta catagattag atacatgtag tctgtctcct aagtggctta cataaagccg 10500 tgggcgtctt ttcctgtgtc tcctcttcgg cagttcctaa ggcacctacc aggtttatag 10560 ttgctgatgt cctacccata ctatgtgtca agaaaggtag actcaaaaag gattgatgca 10620 gatattcctt agaagttttc atgcccaaag attcttggac cttagttgca tgagggaaga 10680 ggccttgcct cgtgaaagga gaccgaagcc ccttattaaa ggggtcactc agaaaaaggt 10740 gttatctaaa ttccctcttt gtagttttat ttcttcaatg tattccccat cactagctgt 10800 gtgtgctgga gccaaatgtt gggcccactg ccatctttta aagaatgcta ctgtaaaggg 10860 ataactttac tagacagttt gggaaataga agaaacaatt atctaaaatg ctcttgttcc 10920 aaacacaaaa actagcttca tgtttttgtg tataaccctg cctagtctta ctctgtagac 10980 atccttcccc ccacacccat cgtgtatgtt ttttatcccc atctaacatt atatcaaata 11040 cattgtccta tgttgctacc ttgtcttcat tattatgatt tgtcaagcaa ttgggttgta 11100 ctgactgcag tgttactcct catttccttt ccccctttct tttccatact acttaaaaag 11160 caagaaagga aaaaaaattc tatttaagag aaaatcctat gtagtcgtaa tggaatttat 11220 tttcctgagt gcctgggcgt gctccaaaca gatagctgtg attgtgtcat ctcttcttac 11280 taagcatctg accactcact acctgtgcat ctgatcccta ggctcatggc atgagggctg 11340 tggctcttga tgtttttttt tttgatgtgc ttagtatcac cagaactgta ttgactctaa 11400 gagtatggaa gctttgtaaa gttaactaac gatttgtagg ttttgtgcaa atagccagct 11460 ctccaggtag agacatttta attctgttgc tttttttttt tgtagcatat tattatattg 11520 cagatgttta cttagtatga aaaatggata taattaagtt ctaaatgggt actagatatt 11580 aacttcaaaa tgaatttgca acttgtatgc tttgcatatg ccgtatgctt tgcatatgcc 11640 atatgctgtt gtttccaagg atgtaaagca gttttccact tccaccatct tataaattaa 11700 aaccagaggg ctagacgtta tccagcaagg gacctacaaa attcgaaaaa aaacttcgga 11760 tcattatcag gattaagttt gttgaaattg tatcaggaaa atagactttt ccttgtatta 11820 ttattgactc atataatgca actcatactg gtttcatatt cagaaggtat atagcacctg 11880 ttgtaatatt tcaggttgat ttctgcaatc tttatttaaa aatacaaatc cgcaaagcaa 11940 attgcataaa cggtatacaa aatgaaatca ccaggcttca aatatatttt atgtaaatgt 12000 taagtgctct ttagattagc aaaaggaaga caaacatttt gattacctct ttaatactaa 12060 atttttgttt ctgaattata aatctttgaa gatataatta gtgtaagata tataccaatc 12120 aacatagtcc agaaaaccct gaacctggct gcttattagt tggaaagagt acctatttag 12180 actttctaag ccacagtatg tggctaagtg attaatatgc actattacac tttcacaaat 12240 actttttgga ccaacatatt gacttaaggg agcatacttt gtactaaaaa actaatttga 12300 aggtgaaatc acagcaaatt taacagttca accttgtgta tttgtagagt tttagttgga 12360 tcataggata atgacattag ctaaagtcag caagcatgta aaataaacat tttgaagttt 12420 aatagtcatt tatgttctac aataataaat atttggcacc caccataacc accactacca 12480 aaatatgtat gctgaaattg tgtctccagg atgaataaag aggtgtaagg caagctgaca 12540 cccattagaa aaaaaggttt agaaagggag agagggcctg gacttcccta agcttaaccc 12600 caacataggg cctgtagctc ctgttggaac caatttataa aacttcaagg tgggaatcaa 12660 tagattgttt ccagtgttat atagttcctc tttaatcagt tgatcttcaa ccaaagcttc 12720 aagtggtttt tgtttttatt tattgccaaa agcttcttaa ccaatgtgat tataaatgtt 12780 taaattcatt tacagtgaaa tggatttgta tcttccttgc tctacagtag atagtgaagc 12840 ctccgtgacc tgtgtaagct gcaggttgag gagaatgttt aaacagtagg cgagcgttca 12900 tacagtgtcc ctggactgca tttccttgct tacagagcct tggagctggg ccaactcgca 12960 tgagcgggag gaggagaggc tcagttttga gagccctggg tttatttctg gctgtgggga 13020 tacgagaatg tgggagaagg tgtaaacata gtttgaggga ggtcgtagac ttggaggttc 13080 ttgccagggg taaatgctgg ctggatttgt ttatggatgg atttgattat agctgcttca 13140 cgcttttacc tctggagttg gtgcttccaa ggggtgatgc tcattgtctg gatctccttt 13200 agtgggcatc tcagaattct ttgctgcacc aaccacagca tttcttttcc ctttcttttg 13260 gtatttgtat tcaaatgcct gacttcctga cctgctgtga gggtttttaa aaataatgaa 13320 caccaagctt cttgggaaaa gacaggtatg taaatactgc aagcttatag catttggggg 13380 atggagttgt gggattttgg agtttgtgac ccccatgtgt ctgtgtatca ctctctctaa 13440 tgtagagaga cagagcagca gttcctgagc tctgtctttt gtctcttcat gggttggtca 13500 ctgatttccc ctttcagggt ctgaatccct ttcccttgcc tccccccatg tgggttccgg 13560 aaggcccaca tccaacagct gcccttggtc aacccccagc cctgcacaca ggtgttttct 13620 gctgttcaga gctcaggggc gctgtccagc cagctgatga cagctcactg gtggcgctgg 13680 tctcactcca gtgctggctc tgccctcttg gccctgttgc ttcctccttg gaggattagt 13740 caccccagtc ttggggcatt ttacccacct gtatgactag cacacattgc atccacttgc 13800 caaggagtgt tagggataca gtctcaagaa aacctgccca gtagaaatga cttcctctgg 13860 agaaatttag atgataaatt ccatagggag aaatcatgtt ttattaatga gaaacaaaat 13920 cgtgcactag tgggagttca ggctcagaag ccagtgcata gtccagcctg gcgacctggc 13980 atggtgtgtt accctggcca aagctcccca acctctgttc atgatttccc tctgtaaaat 14040 gggaataaca atagtatcca tttcttttgg gtggttataa ggaatactta agttaactca 14100 ttcatgtaaa gcacttgaat agtggcttgt atacagtaag cacacagtaa atgtacattg 14160 tcattgtatc tttgtacctc aacatctgaa actatggttt atacatagca catactcatt 14220 catgctttgt tgagtaaatg taatattcag aaggtgggaa agtgatttcc ttctctctta 14280 tcataaaata agagttatat ttggttgaaa taaggcatca taggcctgaa gacatttact 14340 ccctggctgc tggcagggga gtagaatgtt ccaaactcta tgttgataac tgggtgccta 14400 tgatttgtta atcataccta acaaactcag tctgtcaggt gtttcagtaa aatttactga 14460 ctcccaaggt aattttgaaa ggacaaagcc atcaggtgat ttgttttcaa taagcattat 14520 tgggatattt cagactccaa catgtgcagt ggagatatta acgagacttt accctttcta 14580 aacttagggg cttactctgc attgtaacgt atgtttgctt ttaaggcgat agatgggtag 14640 atagatgctc tgtaattgaa tagcctactg ggcttctgtg tagcacagca ctgtgtaagt 14700 cgcaggtgtt cagtaaatgt attttaaatc aatgagtaat tgatgggata tttttcttca 14760 cattacatca cattgaagat gcttgtgagg aagttttgtt catgattgtc ctcctaggca 14820 ggcgccacct aacatgtttc tctctagctg cggggtattg aatttgaggc tgctctcagc 14880 attactgctc atctcaaaga ggaacaaatg gtcccccaaa tggtccagat gcaacagtag 14940 ggaaaggagt gccaccattc attctgaggt gcctgggcca tactctggat gaagctcaca 15000 gctctaacgt aaggcggtaa gtcatttttt ttttaaactt aaggaagtta atttgaaagt 15060 ttatgctgat gccatttcat atgacagaga ctaaagggta tacgtccttg tgttctcttg 15120 gtggggactg ctgtcctaaa ggtaaaagat tttgatccag aggttggatt gttcctcgtg 15180 tggcaacagt cctctgacct cagtcatttt caaagtctaa tgtctttttg gctaacaaaa 15240 gacatcaagc tagagattaa aatttgcaga agatgggagt ttttttgttg ttttcggttt 15300 tttttttttt tttttttgag acggagtctc actctgtcac ccaggctgga gtgcaggtga 15360 tcatatctac aaaggtgaca agaacatgaa cctgcagtcc tgttatcagc attttgtaag 15420 tgcttcttct cattctaaaa atatgtattg attttaaaaa tgaattctgt aacaggatag 15480 caacagtagt aaaaatcctt tttatggaga aatttagagc tttcagctaa catacacaca 15540 tactcattca aatgaaacag cttgtcttgt tttaaagaaa tatttctgtt gtccattgta 15600 caaattttaa taacttgtct ttcctttttt taggtctctg tgttccagat ttttttttgt 15660 aactatgaaa tccttgcctt gtagtcaact taaaatgaaa taaatttatc agtgcaaatg 15720 agtacactgt ggttcttgtg tggcaaaatt atgcaggtgg tttgaaggat tcattgtttt 15780 attaaaagac tagaagaatg tacaaatgct ttccttcttt ctcatgtttc tctcacccaa 15840 actgaagtgg tccatttaaa atctgatctt cataaaagga agagttgtca caggacatct 15900 gatttatgag ccatctgatt tgcttctgct gtagttccac actctgctgg ggtacaaaaa 15960 tcaggcactt ttatttcagc cctgacaaca gtgcggcctt cccaaagtat gagaagagca 16020 gcgtgcagag gtaatgtgca gtcttccgaa cagagcactg aactggtttt taatgaaggg 16080 taatatgtac ttggggtcca aactctagtt aattccggaa aacatatcct ctcactagtg 16140 ttgggtctta gtaaatgagt gttctgtgtt gaatccttgc ttgagaggaa ggaggtcata 16200 attggtcagg gattctattt ctttctttct ctcttttttt tttttttttt tttttaaaga 16260 aaagcaattg cttagaatga gatgaatcta tatcagggta tgtttttgta ggatcatatt 16320 aatcatctta cccacagact gactttgtgg tcaaacttta ctcataagga tttattaaaa 16380 gcaatggttt ggttaaaaaa aaaaaaaaaa ggtatgttta tgggaggaat gtagacattt 16440 agttgaagta cagttgatag actttgacca ctcagggata ttttctccat ggactgtctt 16500 acagaggcag agcctaactg catatgcttc ttatgactct gatgcatgaa tggcagccct 16560 aaatgggctc ctcttcactt attccttctc cccaatctcc ccagcactgt gttctgtttc 16620 cttccctgca agagtctgcg tttatcagtc aggtactcct gtcctcagag tgccaggcca 16680 gctctgcatc cttctccaca agaccaggct gaagtctgcc tgttcacctg ttgatgctac 16740 acagtttgag ctcaagctcc ttattctttg ccggcttccc ttttttgagt tccagtggag 16800 tggtactctt gtgtcagagg gattcagcac tgcccactat gtcatggtac tctgcgctgc 16860 tcctcttggg tctggatggg gccacagttg tccagtccca tcatgctgca gccctaagct 16920 ccagagggtg accttgatag gatgcacatc ctggagctta atttcataag ccttgggggc 16980 ctgcaagcag gaggagttgg aagcccttga gaggccagat cctgtgggtt tgagcgcaga 17040 gtctacagag ttaaaacctg cagaacaaag aaatcaaaag aaggacataa agaaagtttg 17100 ggagacattg agtctgcaaa gcaaagaaat ccagtattgc aaggggaaga ttttgctttg 17160 ggaggttgaa gttttgtttg ccctccgaat gaaagtggta tttctgttcg ttagccaaaa 17220 aatcatcaga ctttgaaaat cagtgaggtc agaggactgt tgccacacca ggaatcagaa 17280 ttgtggcatt ggaagtttct tcagaaaagc acctttaggc ccgcccaact ttacagatta 17340 aggaagtgag acccataaaa gtttgtcatg ttctcttatt ttaaaagata tctcgaccat 17400 gtctgcagtt atggcctctt tttcctgcct cacccaaagg ggtcgaggag ctttctcttg 17460 cccactctgg agagaactcc tggggccccc ccgctgctct ccctgcccca tgactttgtc 17520 ccagaataca gaacacagtg ggctggatca gcctctgctg aacttttccc tgccagcctc 17580 cctgcccgcc ttcttgcccc cgcccctacc ccccacacag gaatctagag ccctcccagt 17640 ggtggtgtga agcctctgga atgtggcact ctgacccctg gcagttgagt gtcctggatg 17700 gacctggcct gggaaacagg aaggactcgg gggaggggga aggagtgccc catggccttg 17760 aggtgggatg gagggaaagg aaatgctgcc cagcgcggcc ccacaggcag ctcccgctag 17820 actcccacac tgggggcggg gctggcaggg cagggagccg gcaggtttcc tcagaccagg 17880 cctgtgcccc ggggcctctg gcagagccag gctcaggaag gtgaactgcc agagcagttg 17940 gggtgaggag gaaaggatgt gaggggaagg aggaggagca ggccagaatg cagaagcaat 18000 tgtagtagga gaggacaaag gtgatggatg agaaaaggaa tgtccagtga ctgtgtggac 18060 tagacacact tctcacttta gaaaccaagt atattagaaa gcggaaaaga ccctcctaaa 18120 gaaacttcca gccaggtgca gtggctcgtg cctgtaatcc cagctactca ggaggctgag 18180 gcaggagaat tgcctgaacc ggggaggcag aggttgcagt gagccgagat tacgctactg 18240 tactccagcc tgtatgatag agtgagactt cgtctcaaaa aaaaccttcc atgccatttt 18300 tattttagag taaagcacac cccatttgga taccccagtg ctttttaaca tgctcagcct 18360 tggcaaagat agggtcactg gggttagtga aaggggtggg tgcacatgca agtgtgaggc 18420 caactgttac tcacctgtcc ctttctgttt tctttcattg gtggacccac ctggaccaca 18480 cttgggtctg ttacctccct caaagaggtt tccagaaaaa cctgaggacg tgcctcaggg 18540 tgatgatatg cccagcagct ctttgctcct ctccgggatt agggaacctt gtatcctcaa 18600 cttcctccta attaaagcct ttggcagaat caggatgcag aaccagggcc tggaaccgcc 18660 tcagtaaacc cctgtggaac cactagcaga tcatgctgtt tccaaactgc gcgtgttggc 18720 aagccagctt ttcactgggg atgtttaaaa ttaaattcca tgctcttggg aaaggaatag 18780 aaactatagg cagagtctgt ttctgaaaca cctgttttaa gtggaaagcg atcctaactg 18840 ctgccatacc tgaagggctt gtgtttcctg tgcagctgga atctctctgt gctgagtaaa 18900 ttaacttctg tgcccagggt agtcttgaat gataattagt gctccgtggc catgccacac 18960 atcctgtctg aagcaagggt gctaataatg tcctttcctc tcttctcaca agacattata 19020 tgaagtccta ggaatagtgg tcctggtgct aaatagaaat gtcgtggtta tggcactaat 19080 ttattgggcc tgccagccca ggtgtgcagc atccctttcc tccacccgat tgataagaca 19140 ccaggtgagg tccagtgtga tgctccaggc agtgctgcag ctgagttggc ctgtgtctcc 19200 tggctcaagg ggtgcatctt ctggtctgcc ttgagaactt accagtcttc cataaattgg 19260 aagttcaaag caatggaatg aacaatggag agaacaagca catacacata agttaaaagc 19320 tgtgcatctc tgtatattct aatcctttga aacttggtga tcattattgt tcattacata 19380 ggatatggat ttgatcagag catcctttgg gagttgtctg catttggtat ttccttttat 19440 atcctgaaag ctgttagttc ttgaaaggtg tattgaacct aataaaatag taattagagg 19500 tatttccaca aattgtgagc tttgtcattt ggatagtgtt tttgagactt cttttccact 19560 ctaaatacca actttaaata gaataatatg acagaattat gatcttgaaa aatttatcta 19620 cttaactgca tgtgggaata catatttctg tagcatgtgg aaaaaggact gagtgatgca 19680 tacctgttat atttacattt tcttttggaa gcaaattatt ttctacatgt gtatgtttat 19740 gcctacatgt gttctaattt gaaatgtaaa aatctataaa aaatttttcc ttgactgcta 19800 tgttttaaaa tatctgaaag ttacctgctt cctagtcctt ggaaggcctg aattattata 19860 aagtttattt ctttgcatct caatcattcc caaaagattt taaatatgaa tgcataagta 19920 gtgaataaac aaacctgaaa gtttgatacc tacagaacat ttttaaagct gcttcttttt 19980 ttccatgcag tattgttagt agagtataat actagttatg tcgtagagca gtcacttact 20040 catttattta cttcagcaag tggatattgg ggcctgctgt gtgttcagca cagagtgagt 20100 atagaaatga gggaagcagt ttctgctctc aagaagctca gcaatgcctt aggataaggg 20160 aagtgctggg ggcgggtgtg ctggaaagag agagggatgg gtggaagaaa attgaaaatc 20220 agacttcttt gaagaaaaga tacttaagct gggcacacta tggtagtagc agctttctag 20280 gtagaccgga ggaaaaaaaa gtgttcctaa gcagagaaat gggatccgta aaggcccatt 20340 ggagtgaaca tgcatagtat actcagggaa tcagttgatt gagtaaatat ttatggaggc 20400 cctactatgt gctgagccct gttccaggtg cagaagatac agcagtgaac gagacaaaaa 20460 ttcttaccaa ctggagcttt ctgatagaca gaatcagcca aattagtaag gaaaacatac 20520 agcatacaag aggagagaag tgctacggag aaaaataaag ctgagaaagg ggacagagac 20580 aaccacagtg gtatctggtt tgcagtttta atggtctgcc aaagtcttgc cagaagttga 20640 ggttgattga agacctgaag gtgtgctgga gtgaaccatg cacacacctg gagtaggatg 20700 ttccagacag agggaacaca agtgccaaag ccttgaggct gcagtagccg gggaagggaa 20760 gagcagctgt ctgcacagtg ggaggtaact ggacggtcag ctagggctag gcacaggcct 20820 catttatcag tgtgaggggt cagacttgac cctgtggtca ggagagtgcc atcatagatt 20880 tatagatacg gcttgtgctg tcatctctgg gacccccaaa gtgaggaccc agtgagttgg 20940 agctgagtgt ctgttaggta tagaggtaag atactacaca aaggaaactc agttttgtga 21000 aaaacgtgta aatatgtttt tcaaacatgt tcttaagact ttttaatctt tctatgttct 21060 tgttgtctgg tttatttccc atgaatgtta gggatcagat cactgctgtg taaataatag 21120 tccacccaag gtattcatat gcatttcttg acatggccta aggcaagagt cctgctttcc 21180 attgtcccct aatttatatt tgcttctttc tggagagttt taaaatgaat ttcaagggct 21240 tgatgccttt aaaatcagtg gatcaggccc tgtgttgcga gatttcttaa tgggccacag 21300 ttgtctctgt gattctcatt tccagtttat agcactactg tccacctaaa catacactaa 21360 ccatgtggat cttaatgatg ttttaagaaa aaaggttttg acatacatgc cagacttcca 21420 cttaagatga aactgggcaa ctaggaaatg gtatgccgta catatgaagc actgtttatt 21480 gaatattggt ggcaaatttt gttttgttgc taggaccaga ttcttggagt gtgtagtagg 21540 aataaataag accaaacaca atcttgcttc actgaaaaaa aaaaaaaaat caaaggtttg 21600 ggtttttagg caagatattt agggaagata tttctttgta gaccagtatt agagttaatg 21660 gcaaggatca gtagtataat cgttggaatt tatgtcactc tttcatttat cttccacttg 21720 gtttattttg aagacttcag ataaattatt aaggtaaagt gcataggagt gctgaagttg 21780 atttgtaaaa gcattccttg caaaaatgat tgctagctga tttcacaaag ttgttatcct 21840 gaaaagattc cttattgcaa cttccagatc attacacctt ttaaaaacta ttatttgctt 21900 ttcaggagta ttgctgtata tgaaggtctt agcaaagaac ctctcgtttt cctcgatgat 21960 agattagttg agatcatgtt tatattggtg ggttttcagt tctgatggct ctgcagctct 22020 gttccttggc tgcaaggtgt catagagggg cctctgcttg tcctggtggt taaaggacca 22080 tcagcaggag tctccaggcc gtattactgg agacggacct caccagagtg agatgggaga 22140 attagtggtg ttcctgtcct tcttttccct gatgctcagg tcttccggat tgacattaac 22200 ttgaaatttt gcccagcctt tgactacctc actttcctgt ttcttcagct cctcatgtcc 22260 cctgctgggt gacagcctat atgggattta gacctaggtt gctcagcttc cctcctcagt 22320 aaaactgatg aatccatgga gagtgcttat tccagtgtct ggcacataga aagtgctcag 22380 tgagtgttag ctgttatcag aagattagtc tcagggggtg aaatgtgctt tattatctgt 22440 gctacctctt cacctagtat aatgaaaaat gtatatcctt cacagtgctt attttcagag 22500 caccatgtgt atttaattgt tagaaggtct agtcacattt cttttcactc tagtggttca 22560 cacttttgga tgtgtgcaat tcccatacct agtcttctgg tttactcttg gtgatttgac 22620 taacagattt taggttagag catccattaa tatttataaa atggtactgt gcattattca 22680 tgtcctcaag tcacattctg gcaagaagca ttaacgtgtg tcacattatc taattattgt 22740 cataatacac agaagtggca ggaagggaga ttaaaagttt attatttaag ttattattta 22800 tttatctaac atggttgagg catattatat tcattcactc ttttgacatg tcataaatat 22860 tgatttgtgg aagtgttgct tgatacagta ttgaggttaa aataaaactt gcgctaatat 22920 taatatttca caagtaggaa gactgagatc caaagatgtt tctctgttat ttctaaataa 22980 ttgactttct tcctctacca acaaacttac tttccagaaa aagtttgctg ctcaaagatt 23040 cacattgctt tgattaaatt catgtgctcg tcctctctat ccctatttat tccttctgca 23100 ggtatttatt ttgctaagct actatttgct aagccctatg atagttcagt gatgaagagt 23160 aaaaagtaga tcgttagttt caaagaaccc acttcgggta aaggagatta gatgaacaat 23220 ttcattgtgc tgctgtaagt gaagtaattt taatagatat gcagtagaag tcctacactg 23280 gaaatagtac ttaattcagt ccctatagag tggggaaacc taaaggaagt attaaccact 23340 gaggagaagt ttccagtgca gatgaagaga agtagctttc taggcagaga gaacagccat 23400 gaatgtcatg aaacaacatg ggacattcag gcaattcata gtcactaagt cttactggag 23460 gaagaggaag gcatggcaga gatgaggttg aagaagtttg ctgggatcaa gtctaaactg 23520 gctttgtgtt attccatgcc gggtactcaa atgactgcat gttggacttc attttgtaga 23580 ccattgcttt gcaactccat atccttaaga agtttggtta ggctgttctc ttaacactcc 23640 ttgctgggtg ttacttacat taaagcacct ctgtaggagg ataaaacact gtgaaccagg 23700 tatgtaagag acagctaatg acctggctta atccaaggat gtctttttag agtatcctga 23760 tgaagagatc aaataggaaa taattagaca tttctccaaa atactttttt cctcaaccaa 23820 gatttattaa tttttaagta agagtatgtt tgataactga aaattgatat cctttgaagc 23880 tggataaaca tgcagttaca ttttttactg gacattcaag ttcagagaca gctgcgttgc 23940 ttaatcatgg catgtgtctg aatagaaggc tgatctagca cagcacaaac atggaacagg 24000 ggagaagacc tgagattttc cttggaaact attggatttt actctcaaag cagagggaat 24060 ataagagagt taaggtctgt gccacaaagt gcagctagga aaacatgtaa atgggcatta 24120 aatacctacc ttggtgattt aaaaaatttc ggaaatggag ccattcagtg tctaactttg 24180 agtaacttcc tggtaaaacc tcaaaagaga aagtgattga aaaaatagac atcagagtat 24240 ctggatgagt taacatgctg tgtgagacta ctcattaaaa gccatttata agatgtctta 24300 ataatctttc tcaagaacca gttttttaat tttgttaact gtttttgttg cttttacttt 24360 ctccatagat ttattatttc catcctcctt gattctttgg acttactctg ttgctctttt 24420 ttcaagttgg atgcttagtt cattcttttt ttttttttca gcttgtctta tgccctgtga 24480 aatttccctt taaacactgt cttaactgtg tccccatcaa tgttgacata tggtacttcc 24540 actgttcttc atttttaagt atttggtaag ttccctataa acccctcaca ttaaaaaatc 24600 tttttattat agaaacttcc aaatatatac agaagtagag acagttgtgt aatgacagag 24660 tgcttcagca gttgtcagtt catagccaat cctgttttat taaacgagat ttactgagac 24720 ctcagctagt aaatcctcta agaatggaga tactggaaag gtcctggaaa gtgatattta 24780 ttacattaaa agacctttgg cttgccttac tctacatagt gactttgttt tcaaaacaaa 24840 gtctaattgg tctgataatg gaacattatt ttaaatttaa attatgactg ccataggcat 24900 atgattctct ggcactgtcc ccaataaatt gttagcattg tacaatttca ataaagatgc 24960 tagaatcaaa cacaaggata aagtctaaag gggcctcatg gattatccat tctatgtcct 25020 ttttgtaatt aaaaaattgg agcttagaga gctaaatgcc ttgtttgggg tcctgccgtg 25080 aatgagtgac agagctaagg ttgtggtctc ctgggccaag gtcctctcca ttacaccatc 25140 cccctctggg ctgtttcttc gtatgcagtt taagatggcc cctaggttag gcagaactgt 25200 gaagtaggag agttgtgtcc cgtagcccca atttctcctg tccttatcat ttcttgtttc 25260 atgaaacatg aaacatgaaa ctggtgcttc tctgttagtt gtattgctga ggtgtatgat 25320 gaaaagtaca gcaatgggac atatacttaa taacaaatat tttgctgtaa gttgatttat 25380 tatattggat cagtactgct tgtttttcag tgatttcctc acttggacat aatacaaatc 25440 agtgttcttg ttgtttgtta tacatttaga catttacatc agtttaaaag ttagtcttca 25500 acaaatattt gtgtccacac tttagtattc cattgtttga aagattcagg ctttgtccac 25560 tctggtttat agttacatat gcttaaaatt ttttcacttt gtggaaatgt ctgtaaaata 25620 catttttgta acatgaatca tttctactga taaccataaa tttgcccttc ctgctttgca 25680 ctgattgcgg cacaaggaag agaaagagaa ggaagcacca gctttgggaa cataggcctt 25740 aagcttgaag cagagagaat gaaagcagac tgtattagat tcctcctcga gcagaacgga 25800 atgcctgata acagccagcc aggagctggg ggagttaaga gcagcgattt tacattacac 25860 tggagttttt aggctggcat gaaaagagat gccagtatta gctatttgca tcttcctgta 25920 gccacaggga tatgtgaaca gctggagaat taatttgata tatattccca ggagcttctc 25980 ttctgccaac agaattccat tactgtgagc ttctgcgtgt gtactgtgaa tttgtttctt 26040 gctttttaat atctgctata gcatatttct ggtaagtgga ttgccatata ataggatgca 26100 gccatctctt ggtttggagt aaactctatt aacctaaaca gatacaaggt ggaaaaggta 26160 agaatctgag catacaaatg ccttccacct tcaggctggc agaaatttca tgtgaacaca 26220 cagtactttg tccagtacgt gcactcatac ctgaaatgtt ttgtttcatg ttagtggaat 26280 ttcacacacc agcaagcaga tggaaagata tgttgaagag catagaggtt gaatccagga 26340 ccactcaggc tagtgttgcc acagctcgtt cctgtagctc cttagcacag ggggctatat 26400 ctgctggagc tgcttcttca catatctggt gttcgttatg cctccacaca tcttgtaaga 26460 ggcctcatgc ttcatatcct tctcagtgcc cactttcctg ccctcaggat ggaaacttct 26520 ctccaactag cctttgcatt gaaaatcttt catgtcaaat gtagaaggaa agaatttcag 26580 gcatcaccga gaatattgaa agatattctt agaggagaaa gactaagtcc aggacaagac 26640 tgaatgtagc tattggaagt acaggtgttt cctgccttgg caatataagt tgttcactct 26700 tataaagcaa ataaaaaaac ctactcatat tctgcaaaat cattcattac actaaaaata 26760 agagagctgt tgggaaaaaa tagggttggg gaaggtcact caagacctat gcatctgtgt 26820 gtatagagga taaatgcaaa caataagaac tcaaataaaa atactggccc aggtgagctg 26880 tccctggcat ttatttgcac ctggatctct ggtagcccat gctttgcagc cctccacact 26940 ccgaagctca ttctgcctcc tttcagggca cggggtgttg aagatactta cttttcatgt 27000 ggaacggtgt taccagaatt aaacataaca tccagtagtt tcttcatcat tgtttttatg 27060 ggggtttttt gtttgtttgt ttgttttgtt ttgtttttag tacaggggca agtgtctcca 27120 gccttctcat ttctgcttgc agtttcccca gatagtttaa tacagaggac attttaacca 27180 cgagagcagt catttcttgc actaaagaaa ggaatttaag caaaattttc agccttttct 27240 tcaggtcttc catatactgg taagacttaa ctgtgagaaa acttgctcct gatacttatt 27300 tttatttatt tatttttttg acagagtctt gctctgtcgc ccaggctcaa gtgcagtggt 27360 gcgattttgg ctcactgcaa cctccacctc ccaggttcaa gcaattctca tgcctcagcc 27420 tcccaattag ctgggattac aggcatgcgc caccataccc ggctaatttt tgtgtttttt 27480 gtagagacag ggtttcgtca tgttgcccag gctggtcttg aactcctggg gtgaagtcat 27540 ctgcctttct tagtctccca aagtgctggg attacaggtg tgagccattg cgcctggcca 27600 acttttgatt tttttttttt tttttttttt tttttttgag ttggagtctt gctctgtcac 27660 ccaggctgga gtgcagtgac atgatctggg ctcactgcaa gctccgcctc tgggttcatg 27720 acattctcct gcctcagcct cccgagaagc tgggactaca ggcgcccgcc accacgccag 27780 ctaatttttt tgtattttta gtggagatgg ggtttcaccg tgttagccag gatggtctcg 27840 atctcctgac ctcgtgatcc gcccatctcg gcttcccaaa gtgctgagat tacaagcgtg 27900 agccaccgcg cccggcccac ttttgatttt tttagaacat taatgtgctg ggaaaaatca 27960 catgagaacc agcagaaccg ctgtattata ctgacaccat tttcctattt accagttaca 28020 tattggctaa atcacattaa agaaacattc atttttatgg ttccacggta actgttagaa 28080 gaggtaccca gcattctttt gttattcctt ggcctaggtg tacctgcagg acatgctctt 28140 tagaacagca gttctcaaag tgtgggctac agaccgctta gaagcctgaa gccctttttg 28200 ggtggtaaat caggtcaaaa ctaattccat aatcctatta acattttatt tgcttttttt 28260 tctttttcca ctttgcacta atggtgcaaa aaaaaaaaaa aatggtacat aaaactcctg 28320 atactttagt atgaatagag gttgtggcac caaattgtgc tagtaagtta ctgtattcac 28380 cgtgtacttt cagtaaagca aaatggtaag gaattaaaat aataaaataa aaattagttt 28440 ccttgaatgc ccttgatcga agcaatacaa attattaagt ttattacatt tcagccctca 28500 attatacatc ttattaatgt tctgtgggat gaaatgggaa atacacataa agtaattctg 28560 ctgcatactg aagtacagag cttgtcatga gaaaaagtgt tatatttcaa gatgtcttct 28620 gaacatccaa gtggaagtgt tggtttagca gtttaatata cctggcactt cactgtagta 28680 ctcaatattg gacatatttg aacatatatg aatttgaata tatttatgaa attaggtttc 28740 tctcttttat acttagagaa agtaacttat gtaatatcct aatttggtat cttgcagaaa 28800 ttataaactt aaaaatacat ttgcagaggt acctcagttg tcttcacctt tcaggctagg 28860 agactacatc ttgtcacatt tagggtacca ggaaaaaacc tacttcatat agttaaaaac 28920 cagtacctct gtttttaact tttttttttt cctgaccagt ccaaggagaa accaggagct 28980 atgttagaga atcagaacct tgattcttgt gctgtgtatg tggatttact taggaaaaga 29040 gagattttaa aacataattt gccaactgct gttagcagaa ctcttgggct cagatttgat 29100 ttccataatc caacctcatt acatttttgc tttcagtcct tgttgatttt gttaaattat 29160 tttaaaatat agtgttctgt aattgaatac cagaaaagca tttgtctcag tagcatgttg 29220 tacttgatcc tatgaagcgg tggttgctag ggaaaccatt tcccaagaag aaaaacaaag 29280 tattgattag gagcgggcag agaattgaaa gagtttactc tcaaaaaaaa gaaaataaaa 29340 gtgtctacga tgttgttagc tttagagtca gtgatgcttc ctggctgaac atagtggcag 29400 caatgtttct gaggggaatg gtcaccttta cctgagcatc ttcaggtgag tctgctgctg 29460 ctgccaggaa acttctgctg agaaagcagc agaaaacagc aaggggccag cggcctcatc 29520 aaaggccttt tagctgtgat tccagatttg agggaaatgg tcttcctgtg gggaggcaga 29580 gaaaagcatc tggcagtgaa aaccaattag ctgtgagatg ggctgcttgg aaggactgtt 29640 gatgccagcc aggtgataag aagttgaagt cttagagctg aacgggggga agagagtaat 29700 gatagatgat gcctagtgag aaatgtgccc agtattccaa gcatttcttt gaaatgtgtg 29760 cctcatacag tgctgtaatg ataagctttt aaaataggat ttgtattgtt taaacatgtt 29820 ctttatatta tcttgaaaca tagttttcct tgtgatgttt tgtatatatg tgtgtgtata 29880 tatgtataca cacacacatt ttttaattag acactttttt tgttaaagac acaactgcat 29940 tgaccatacc agtgacatat atgtgtgtct atatacacat atatgtataa tacatatata 30000 tacatgtgtg tatatatgca tatgtgaggc ttctttgtct accggggaat tatggcttac 30060 ataaaaccac tatggtaggg gacaacaggt ctgtgcagat gtttgcggga gcatagagga 30120 tggagcagct tattctcctg gggcaggtag agaaggcttc tcaaaagagg taacatctga 30180 gctaggtttt gaaggatgag caggagatag cagaggaaga agggaaggag gtattccagg 30240 tggacgacac tcccgagtaa agcatgggag cagttcctgc gtgtttctgg taagagcctc 30300 aagtgtgcct agagcattag gatgtgagag cgactggtga aaaatgagtc tgcagaagta 30360 gctacaagcc atgccatgaa gaagctgata ttcagcagta agaagctggg acattttctt 30420 attgccaaga gagaattact gggcaggttt gttttttgag gaagatggat ctggggacat 30480 tgtggggaat gaggtggaga gacaagagaa tagtggcagg gagccaggtt agaagacatt 30540 gtggtaattc aggagagtgg tgtcttcagg gagttccagt tagaatggtg ggaagggatg 30600 tttactgtgg aaaaaattct gcaataggat caactcaata actgcacata tttataagaa 30660 cagtaataaa gatttcctct taaaactgtg aaggaaagga ggacaaatat acatggcttc 30720 cagtagttca aatgttactt gaatttctta caagtttttt ttaattccta gggctaaact 30780 gcctttcagt tataaatttg catatgtagt cacaaaaatt ccttcatggc aaatctgact 30840 gaactgtagt aaaatgctag tactactggc atgttaacag ttaaacagat aagaaatcac 30900 ttggatgcac attttgttat aactgaagac aaatatcacc ttcttttcta tgttgaattt 30960 tttcaatatg gtgtttttct tctttttctg attttccttt cagagtactt tactaatttc 31020 attgttgtct tgaagatagg aatgtacctt tatgtgtcta tgttatattt gtgtgtgtgt 31080 gtatgttgct tctctctgtc tatatatata aaacatatat atatatctta tatatgtatg 31140 ttgtttctca gaaccaaatt taggctaaat aggagaacag attttaaata aaacataaat 31200 tgtgtgtcgg tataaacttc aaatctgaaa gataatcatt ttactaaaag ctgaggcttt 31260 gatcaaataa gtgagcctga tctcctgaaa tctaaaacaa gagattttct attctccctt 31320 ttcaaaggac cgccctgaag ctctttatgt agtagatgac attcttgtaa cgacctcttg 31380 atgagaaaat tatttatttc agtcaggaag agcagacttt tacttcactt acgtactgag 31440 caatgatgcc agcaataaaa gcttacccta taaaaataag agccatttgt ttgtactgtc 31500 tttttttttt tttttttttt ttgaggcgga gtttcacttt gttgcccagg ctggagtgca 31560 gtggcatgat cttggctgac gacaacctct gcttctctgt ttcaaacaat tatccccctt 31620 cagcctcctg agtaactggg accacaggca tgcgccacca tgtccagcta atttttgtat 31680 ttttaatgga gatggggttt caccatgttg gccaggctga tggccaggtg atctgcctgt 31740 cttggcctcc caaaacgctg ggattacagg catgagccac cactcccagc caaaaacaga 31800 tttttaagat ggcatttata cctgccttag tttgagcaag aggtttaagg tttcttaaat 31860 tcaggtactt taaaaaattc tcttcagata tgcttatcag aaacatgttt actatgtaac 31920 agtcagttcc catttcaggt gggtaaagta tctgttaaag ataatgttaa gtgtgaaaaa 31980 accccatata atggcccaag gattcacaga gagaaatagt ctaatttcct atgaggaata 32040 ttatgaagaa ttgagcctgc atatgattct gtcatattca ttgcattttt tggagctgag 32100 tcccatgata gctcttttta aaatcgacct tatttattgt gacctgctcc attccatgat 32160 cagaacctga tttggtctca gcctttgtat tcagttacct cagcctgtca ctaagctgca 32220 ccactttagt ttttttgttg tttctggctt tgggttggca gtgatggata ctatcagtct 32280 ccccaaacgc attagagggg atcccctcca ttgtaactct tttggtggtg tctgattttt 32340 gatgttgatt attagctgcc ttaatgtagt aagttgaaga gctgttttaa acctgaagtt 32400 ggtctctatc aatagagtaa aaaattagct ttttaaagag agtctcaccc tgtttcccag 32460 accggaatgc agtggcgtga tcaacctcgt gggctcaagt gatcctccca cctcaaactc 32520 ctgagtgctg ggaccacagg catgcacaac cattcccagc taattttttg ttttgttttt 32580 gtagagactg ggtctcactg tgttgcccag gctggtcatg aactcctggg ctcaagtaat 32640 ccccgtgcct tggtctctga aagtgttggg attacaggca tgagccactg tgcctggcca 32700 aaaaagagct ctttaaaaaa taattttgta gattgacaaa tgtgactctt gtaattttat 32760 tgaacatgaa aaaacccagg aatctttatt tgatattaaa catttttaaa ggcatctcag 32820 ttgttgttgt aataacacat taagagaagt agtggttttt tatttccaac ctttgtgcat 32880 atagctattt aatgcctaca tggatggcta ttatttcact tttttcagtt attatgaaga 32940 gattgggttt cattcatttg taaagtttca gccagactgc ctttcacaaa ttgatttgtc 33000 aaaattgaat gttaatcttg acatcccagt gcgtttttgc ccgcgaacag gcctttgaat 33060 gaagctgcaa acacacatta tctggttgtt aattgtttta cagatgagaa ctggactgat 33120 gaccaactgc ttggttttaa accatgcaat gaaaacctta ttgctggctg caatataatc 33180 aatgggaaat gtgaatgtaa caccattcga acctgcagca atccctttga gtttccaagt 33240 caggatatgt gcctttcagc tttaaagaga attgaaggta agcattaatt tttgttaacc 33300 atccagtcgt aagccttagc attatgtaac cctgagtagt aatagtattt gagcttgaaa 33360 aggcaaagac aagtttacag agagtggtag tttgtataat cccaactaag atgcataaag 33420 tttaccaggt gcttatgtgg tgggtgtgtt tccgagacgc tatggaatgg cggtgctgtg 33480 gtggtgtgag cagggccctc tgtgccagcc ctcctaggcc acctctgcat aatggatgat 33540 ttcacaagcg gtgtgcttga gatcagcatc tgttgtcacc aggtccattc tgtttgattt 33600 accgtaggat ggtgctccat gaggagtctg ctaattagaa atgcggctca tctggcatga 33660 aagagttatc tctttagcag aaatgggaga attatgcagt gatgattgaa ggcttaggtt 33720 ggttttgctg gagggcagga atcacatagg gcgtggaggg atcattaggc tcttcacctc 33780 tggagataag tcagaaattg cattcaagta acaagaaaat tatactgata tggctcataa 33840 cttttgtaca ttcatacata caaccaagaa aaaacttcaa gaaattgaat atatgactca 33900 tcaccacagt aatcttgagt gaagtaccca gagcttggtg ccatggctca tagccaaaag 33960 gggtaaagcc tgggtttggg tctcagctct gccgctaacc agcattaaac tctcgtagcc 34020 ccagtttctt cttgagggaa aaaagtactg aaagaggcat tcttaaatat acctcttatc 34080 tctaaaatga tgtcgttgta agatggagtg attggtaata taataccaag gattcttaaa 34140 gtccaaaaat caagcttata agggcagtaa agcaggactg aagttttgga aatgtactta 34200 gtttccaagt tatattaaat ggtaaggaag aagaaatttg cagtagccta caaaataaca 34260 tttctctttg tatttaccag tgctcctgaa tcattagaaa agaatagagt agaaaacaat 34320 agaaataaca tttgcgaatt gcagaggatc ctagagaaca tttagtccag tccttgaact 34380 tgaacactgg gaaactgagt ttccacgcat ttgatgagta tcagtgctag tactaatcac 34440 accaagtatt cttcctgtca catcatgact taaacaatgc catgggttac ctaactcatt 34500 tctgttgttc ggttgccctt tgctccccac acccattgcc tgccacaaaa gggtattttg 34560 ccaacttcag tatgtaaaca tttagggata tagatgacgt gatcacatgt ctgtttacat 34620 agttaataat ttacgcatac acatttaaac ttactcatat aaagtcatac acaatttaga 34680 atcattactt tttgctattc ttacatttca ggaagctgtc ctaaatggtg agtgcgtcac 34740 cattgctgtc attgtctcta ctcaggcccc cttccaggtc atttaatccc ctgctcctga 34800 acatgccctc ttcactgcca cctaagttgt ttgagatgca gcagattttg aatctaagga 34860 tgatgctatc ccaggaagtt ttgtcccaaa acccagtgtc tcgtcacata aaattaggac 34920 aattgcattt gtaattagtt aggcgtgtat ttatgatctt tgcaatgtgg agattgacat 34980 gtggagatta actttattga tttttaaagt gattctaaaa aggctgtttg tgagatccag 35040 catttccaat tttgagatca ttcccacagg aataattacc gtatctgatt gaatttcttt 35100 acttgttatt ttccaactta ttatttcaga tggcttctat gtaatccaaa tgtggaatta 35160 gttgaggtca tttaaggtta atgtatcttc tacaaacagt gctttgttat tcaaaataaa 35220 gtaccaggca gtatgtgaga tcttgtaaga agtcaggatt aggggatttc ccttttctga 35280 gaaggaattt tgaaggaaaa agcatactta tattcaggta tatattgtat atgaaagaat 35340 tagtgatgaa atcatactta ataaatagtc acgtcgttaa tagtgacaga gacttggaat 35400 gctataggac ctcaatgcca agaaagtatg atttggaatt gtgtgataag gtatcatgga 35460 gggtagactg aacgatttag aatagtgatg atgatagatt tcagaagtga ggccaacatg 35520 gacgaaagaa gaaaatgtgg aagtgggaat ctgcatgact tacatggaat acacttggga 35580 aactggtctg aaagagcaga gttttatgtt ggtggacaat aggaaataaa gattggaggg 35640 aggataggac cagattatga aatgtcttaa acattgtgca acaagcttaa gccgggtaat 35700 atgagcaatc agaaatcatt ataggtttat gaacagacag aatcagaaag aagatggtga 35760 agtcaacatt aggcagactt tggttgtggt ggaatgcaaa ctggattagc agagggagat 35820 aatggcagtc aagaggctcg ttggagagct tttcagctgt ttagatgtgt taccacaaga 35880 aactggacta gagtggcagc aggaaaacag caaagaagag atggatatgc accattttga 35940 tggaagaacc tacggtatct gctagcagga aagggaagga tgaaaattac acaaagatga 36000 ttctagggtt tctaacctga aatagaatga tagtgcaagt aatagaagta ggaagacttt 36060 tgaggagtct cgtgataaaa tggtactgaa aataggggta attattttag ggcaatgtag 36120 aggtcaagaa gagcttaaga aaatatagga gatactacag catgtttgga tcatgaccgg 36180 aatgatttag taagaaggaa aagccaataa tgtaagaaag gcgattgcag gagcaaagac 36240 tttaaggaat aaaaaggaca aaattgtttg tttctcaggg aagtataaaa gaaggtcatc 36300 atctgacatt gagatggaag aaagggtgtg gaaaatcaaa gagaaagagg aaaatgtaaa 36360 cagacatctt taaaggagag gacaaatgaa ttaactaggg aaatgtagta gggtcaccta 36420 ctggtgctga cagcttactt gaggttagtg gttatgcctt tgaaaagagg ctggccagca 36480 cagttgcgtg attttctcta gttcttatgt aggtgtgcag tttaggtaga gaggtagccg 36540 gttgggtatt ttcttgacca ttctggtaga aagagagaga gggagtcatg gggattgatg 36600 attatataag agaactttat ggtgattgac catttaatct acactgagaa ggagacaaaa 36660 tcagggagtc agtgatatgg agagtagagg ggaggttcta gtgggattga agaattaatg 36720 gagtgagata gctaaggaag caagtagaaa tgaatacttg aagagaaagt tattattggt 36780 tgtaatgagg gttagaaatt actatggatg atgtagttga ttgaagaggg gtgaagaaca 36840 acatcactgg gggtgggagg tcagataggt gagtcaggat ggagtgcctc taccaatgat 36900 gaagatgcca aggaatagta gcagggaaga gacatggagc cagaggttct gcaggtaaat 36960 gccacacctc cgtgtggctg gtgataacct aattgcttat gcttcaagtg gagaaagttt 37020 tattttttaa aggggaaaag gaagaatgat ctttaagtag cagtgagaac aaagaggata 37080 cctatgcccc ttccaggatt tatggaatgt gggagataaa gccattccca tttgagaaga 37140 cttcaaggga aaaagagtaa tcaggtacta cctaggtttc aatttacaca agaaggtgaa 37200 gggaacatcc agatggaaga atgagggcag aaagaatttt gttgataagc tctagaaggc 37260 acagaatggg atttgggagg gaggcttgta cagggcatgg gtgattttgc aaagccagga 37320 ggggttaaga gtgcagtgat gatggatgac ctggcgaatt tagacttgct ctcgaatctc 37380 ttacaggaag gtgaatagtc tagttaaagc ctccttttag ggactcttct ttgagtagtt 37440 tatagtatta agggtgtgca gtggtgggga tagggcaagc agaatggtat tcttggcaga 37500 gcataatgag aaaacatggg tctccatttc tgtgtttttg gtggtgatgc tgaaaatgaa 37560 ttcgaaatgt tgctataaca ttttttaaaa cgtgtggaag tcattgcagg ccatttcaat 37620 gttatagact tatggagaat actggaaatt tctgggcatt gaaaaagttc ctgtgtatta 37680 cttatgtttg tttaaaagga gttggtcctc aattttactc agtgttttct tttctaaaat 37740 acttgaacat atcaccataa aatcagtttg ttaaaaatat aaaggaaata ggatactgga 37800 taacagtcag ccgtggctta gtgcagaata gtttgtcaga gtgatctgta ttttctaagg 37860 aagaagtctg ataaagggaa tgacctgcaa gttatcccac cttgatctta acaaagttta 37920 tgcttccgtg ccttactact tattcatcag tgatttggga tatgttagac tgagcaccac 37980 aaaagttaaa ttagtgcacc atgggcacat cagtttgatg cagtagtaaa agcagtgcca 38040 gcctctatag cgccagcctc tgcagctgcc acaggtgtgt tctaggggtc tctctgggct 38100 cagtggtcat aagtaatgtc aataccaaag gtttccatga tgcagtgaat aagttgaact 38160 tttaagtttg cctataattc caagttgggt acatggcttt taaaaaaatt attgtaattc 38220 agtggtcata attcaagaat taatcagaaa tgtgaaacca tgccacattt ggtgaagaat 38280 ataaaacacc ctggaaatca ttcatctgtt tatttcatcc agcaagtatt tatgaaatgc 38340 cagatacata cgtgacactg ttgtagttgt tgaggataca gctctcatgg agcttaactc 38400 caaaggggga cttggactga agtcctgcca ccggaactct agcatggaga cactttcagc 38460 tgtagcaaga aaccttctgc catcccctaa gtacagggaa gacagaggct cccagctctg 38520 tgcccaccca catttttata cagggctgaa ggtccatggg gagcctgggt gtcatgcacc 38580 aagggtgtct cctaggtttc tcaaacttga gagagcctga tggtcacaca ggcagtaaat 38640 acataaataa acaagaaaat agcatatggt atgataaaag ctttggaaaa cagtaaccag 38700 tatgctgtga tagagaatta cttggaaaat tgctgtattc agggacctca gggacatatt 38760 ctatgaagga agggcagtcc aaactgaaat ctcaaagccc aaaggagttt gccttatgat 38820 gacctgagga gagagaattc catatggacg gaacagcaac tgcaaaagct ctaaggagag 38880 aacaagcttg gtgtgttgaa ggactagcaa gaagaaacag cgtgaccaga atggggggga 38940 gggaggggta agggccagac cacctgagcc cttgtagaag atgatgagat gtatggattt 39000 tgtgatgagt acacaggaaa gccattaaag gattgtgagc aaaggtttga catgatttgt 39060 cttttaaaag atgaatatgg tggaaaaaga gactggttgg ggtagggaga tgtaagagta 39120 tgaacagagc tggctggtag gggttctttt agtagttcat gtaagagata atggggactc 39180 gtacctgaga gctggagatg aagagaatta ggtgcattcg agataccatt ggaaatagaa 39240 ttagcaggag ttgttaatat ggatgttgag ggaaagagaa atcaaagagt actcttcgat 39300 gtggagttta agcaaattgt agtgcccttt actagaggga ggaccagatt taggaaggac 39360 ccaggagttt tgggcaaatt aaatttgaca tccacataga actatttgtc acaagcaatt 39420 tttaaatgta attcagtaac tcattaagta gaattcatac atacatgctc tatggtgtat 39480 aattacaaag agatatccct gagaaaagtc aaatctctcc agaaaaagtt ataacctcat 39540 caggagtttg tatagaaaaa taagaataac ctactttgca gtgatttatc accaggcaga 39600 aactggctga agaggtgtct gtgttatcca taaaatgtgt tttcatcatt gctgcatcac 39660 acatcacatg ggttagattg gtgtagaata tgtggaacct agtgctgctt gcacagtgaa 39720 cattttgcaa aaggaggaag acacttaggg ctttagataa ttctcctctg ttttagctga 39780 aaacgtttgg agaacgttga atggtttatg atttttggct gttttgaatt atttcctcta 39840 tatagagaaa aaatagacaa tcaaaaattg gcgtgtgata ctatacatca ttgttttata 39900 ttgagtggac aaataagaac cgtgttgcat tcataagatg gataaatcaa ggaaggtttt 39960 gtggaggaac taggaattgg gtccaactct gagaaatttg ttgattggct cagcagaagg 40020 gaaggagagt ttaagagcag ccttggatag aaaagagagg acggacagag aggtacagat 40080 aggaagatac taaggaggcc aaatcagtgg gcttcagtgg ctgattgact ttcggggatg 40140 aagaaaggga agacttccag attgtgggag gactattatc aagataaatg ctggagagga 40200 ggtggtcagg atagattttg ggaggatgat aggatatatt gtggtgtcat ttattagaat 40260 gtgaaataca aggagagcag ctttgtggga ggttggatga ggtaggtact ctgtttggtt 40320 atgtttgagg tatctatgga gaatcaaggg gaggatgaga gttacaggtc tggaactcag 40380 agagagatct gagctggaga taagaattta tgagtcaccc tctgagggag actgtggtgt 40440 ctaatgactt tttgagtaca taaatgatgt gaggagtatg agcattttta ggaagattaa 40500 tgtgacagta atgtgcataa tggaggaaaa aagtgactgg caatgttaac acaccaagag 40560 tgataccaag tgtatcttat acaaaaggac cccgtgcagg gaagctggca caggaagtga 40620 ctcctctgtt agcatggtag aagaaaatcc aagggttgct actgttagcc aagtgaaagt 40680 ctaaaacaca attgccagtg cccatgatat tgagatgtat aaaatggaat acattatttt 40740 aattaattgt taattgattg gttgactaat ttatactccc ttgttcaaaa aagcctttaa 40800 aagtatctga ctatagtaga gctggtcaga cccacacctg gaattagggc tagttcagag 40860 tagaagcact gaaatgatta agtaattgga tagagaggtc aagaaatagg attatagtca 40920 tagactcttt cagtccagaa ggtactcagc attctctaca agtaccttcc gaggcgaccg 40980 ctttcagttt agagctgctt taattgttga ttcattcatg catttattca tgtattcatt 41040 caaaaagaat gtattgagca catgccatgt gccactacac ctcagattca ctagggcaaa 41100 tttagtttgc ttttacctga ggactatttt ctaaatgacc taaattgcaa cacattttca 41160 atatactatg ggagcttgct ttgttttttt gtttttatat tttgatttta cattgtatgg 41220 ctatgttctc tgaaatgaaa ctaattttta cacaaagact tctgttttat agcctttttt 41280 cttttcattt ttaaaagcgc agataacagt tgcctcagat cattaactta attatcctgt 41340 actctaaatt tactatttca gtgctaccta aggaaatggt aatatcatga gttttttatt 41400 ccatttttag ttttttacat ttcacttatt tcacttcctt tttttcttta cacaagtaaa 41460 gcgtacgtgc taaattttat tctcgattgt tacctcgtcc ttggtctaga ggctgtagta 41520 caaccctgtg ctgtgtgtgg cctaataaaa tgttgtaaat cttatgtcat ctgtaatgtt 41580 tgtcagtaaa caaatcttta gtatttagaa agaaatatta tgccattttt tatttatata 41640 actttttttc tcaaaagcat tttgggtatt ttttatgtgc gtgggagaaa atgctcacag 41700 aaaacaccca agtaaatgtg atctggtgga gctgggagta ttagcattca tgccgccttt 41760 gaaatagtgc aacagaattt cctcttaatg ttcctgccac tagaaccgct tcatggagac 41820 acttttcagc tgtagcaaga aacctgctgc cattgcataa gtacagggaa gacagaggct 41880 cccagctctg cccctacccg ctgtcccata cggagctgaa ggtccatggg gagcctgggt 41940 atcatgcact gagggtgtct cctaagtttc tcaaacttga gagagcttga tggtcacaca 42000 gagagctggc ttaactagga ctagactagg gctagggaaa tctgcatttc cagcaagccc 42060 caggagaagc tgctgctgcc cattttcacc acagtttgag tagaaaagtt atagagtgtt 42120 tatgtgaatc gacttcgctg cacagagaaa tggatttgtt gtgctggtgt cagagaacat 42180 catatcttca agtagtcact ggacctaatt ttgatttcca ctacttcttt cattgttttt 42240 ttaaaaaagc atatcttata cattgctttg cttcaaatgt taatacatgg taatgatttc 42300 attatgcaat taaaaaattt ttttctcaag taaatgttaa attaaaaatg aaatatttca 42360 tattatacag gacctcatgg taagggacaa aacaaaccct gggactcgga tgctggaaaa 42420 atcatttgct tttaatagag ttgatccttt tcataggttt acccatttgg tctttgctaa 42480 actttacaaa tatgtctaaa actttatgtc tatggatttt ttttggcaat tatattgatc 42540 tcctgtatgg cattaaatgc cattatttca aaatatttcc taatatgtta ttgactgtat 42600 tagaaataat cattaattca caaaatcaga tatactgaaa attaaattat atgttctcca 42660 aaaatataat tgctgtttac tatgagagta ttatatagca gagttaaact aataagacag 42720 tcgttacaaa ttatcaccac tgtaattgtt ccattggagg gaggatggaa atgggtgggt 42780 tttgtacatt ttataaggtt tatgaagttt aagggaaagg attttgggga aagctcttgg 42840 cctttccaaa ctgctgttaa tatactcatt gatagtggaa tatatgacct ttgatggcac 42900 ttgggaagct aaaggaaaaa tcagctggga ccttgatgat aacgaggtcc ctgacccacc 42960 cggaaggtga ggttaaatgt gttttccttt ggtcaaggga agggaattcc agatcctttg 43020 ctcatgtcag gaaaaacctt tgtgggggcc aggacttagt gttttaaact tgataagtct 43080 tactagttta actctgctaa tatttgaccc ctcccccccc cccaaaaaaa aacaaaaaaa 43140 gataacctct caaactgatt ctcttactcg tcctgaaacc aagcattctg tattgcaagg 43200 gccatgccct ctttgcttca ggactttcat gtgtacagat tcctaaattt taagttacac 43260 agctgcttca tcaaacttac tcatcataga ttctagtttt cttaggctag ttacatctga 43320 aagtggtatt atgcttgtga atgtgttgct ttagctcata taatttgttt aaaaagttta 43380 ttttagagga atccagttaa cactgcttat caagtttaga aacactaagt cctggccccc 43440 tcgaaggttt ttcctgacat gagcagagga tctggaattc ccttcccttg accaaaggaa 43500 aacacgttta acctcacctt ctgcgtgggt cagggatctc attatcatca aggtcccagc 43560 tgatttttcc tttagcttcc caagtgccat caaaggtact tgtttataag gcagtaagac 43620 cattatgcaa ctttaaattt tatcccactc tgtttgtaaa gacaatcaca aattgacaat 43680 attctttgtc ttttcttttt tggagcatat atacagaagc acacacacat atacatattc 43740 gtgtgtatgt atatgtatat ctgtaaacac agatatgtaa agtgacttta aaatacaagt 43800 acatgtgttt cattttaaat tctagcgatg tcattgggaa agaaagtgta gacaaagtta 43860 catatttttg taagtagtgt acttttattt acgttgacag ttaatattta aaaatttttt 43920 gttacagata caaattttgg ggagaattgt ccttagaaac atgttttaaa tcgttgttct 43980 gatgtcttga taactggtac caggaaaata aaagggttac caaaaggcag aaaccagttt 44040 caaattatct gggaaataaa aacaggaaac tcagggaact tggcctgaga ccagttggag 44100 aggagcagtt gttggttgaa taattatacc tgcattttaa ctgttggaag tcatacagcg 44160 accttctggt tgttgtgatt tcttgtgccc gtcaaaaaaa aaaaaaaaaa aggaagtcac 44220 gattaaatat cttatgtcca gttaccaaca ggagtgtgag tgtaaaggag cctctttatt 44280 cctacacatt aggttacaaa aaggattggt ataaaaaaca aattttccag ttgacattta 44340 gagtgatact ttctatacag tcatctgtaa acagcataat aatgttctat accattcatt 44400 catggatttt ttttcacctt cttctttgct ttaactctta actctgagct ttgaagccat 44460 ctcttctgtc caaagatgtg cctagttcaa gcacttttct gtcctctttc catacctaat 44520 cccaggttgg ttaaaggagc accagaagtt gtaggtgagc aggttttgct acatctgggc 44580 tttatttcag gattaaatag aatgtactta ctatgtgttt ttaaaagaga cttattgtta 44640 atattgcaag gacaaaaaca acctgacagg ttgagccaga taaatggtgt tcttgagggc 44700 cgtgcctaaa acttcaccaa cccaggatgc ctaaaaaatg ttggtctaat gactgacctt 44760 agcagggtga catgttcact gctggggtag ccctgagcac tcctgcgcgt gtttgtggtt 44820 tctgacttgg ctgtaggtga cctctcagtt tttcccaggg ctcggcgtgt gctctgtaac 44880 acagagctgc tgagttgctg ctttgcccag ctgtgtccgg ctaagccctg gcccccacaa 44940 aggtttttcc tgacatgaga aaggatctgg aatcccagcc cgggaggcct tctgagctgc 45000 tggcaaagga tgctgtgagt ggagaccact ccccatggca gagctccctt ccctctaagg 45060 ggagctggag ctgtcccttc tgtgtccaca ggggttctcc gcctcctgca gagcggagct 45120 gcagccccag gcttgtctcc atcaaccaga gggtttgcat aagcaggagc tgtcaggcaa 45180 ttaaaacctc gttagttagt gtaattagta ttccaagacc atgatctggg attttggaca 45240 actcaaccaa gctggacagt ttatatcctg cttatgagaa gaaaaaaaaa aagtctgtgc 45300 aagggaagtt caagaaagag tctgggagca taaccacttt agcataaact cacagacagt 45360 caaaaaagtg atcatttcaa atgttcttat aattttatca ctataggtaa ggaatctcta 45420 ttagacactg tgttttttct actctcctgt cttgccagta ttatatattt actagtatct 45480 tgattaagag atcctcagag gggaaaaaaa aaaagctctt ttaaataaag agagttttaa 45540 ttaaaaaaga aaataatttg ctaatgggag gaagcagggc tgaatgcata ccatcatacg 45600 taaaaatata ccactaagca aagccctact ttcctccagt tcttgttgaa ggttaataga 45660 gctaacctta catgaaaatt gtaaagattg ccaaatttgt ctttttccca ctatgccctc 45720 cctattcttt tagttttttt ccacctcaga agcccagcat catctttctc tcgtttccac 45780 cagtaactat aatctaggaa aaattttaaa tttaaattta ccctttttga ttggttgtac 45840 cctagagtgt tacctctgcc agcatcccca cattttaatc ttaattgtga aattcatatt 45900 gaattcaatc cagaaagaag gagcaaggat ttctgcagtc aatgataccc cttcctgact 45960 gagccctgct cacttgggga ggattgtcct ggggctggcc cattgcatag gggcatttga 46020 gatgaatatg gaaaggacct cgccctaaga agtctagtgg gagagatcat agacacaagg 46080 agttataatg agggtagagc agccataccc agtctgaata gtgttatgcc cggtagacat 46140 tctagaagac attgataaac aagaatcaaa gctaacttta atcagtttaa aagctacttt 46200 gtgatttata gacacatatg tttacttcta gaattggcac ttctttgttc tttgtgaaat 46260 gtgtaactta agaaaggtat ttaggagcac taaaattaaa tatttgagtt tttttaaaga 46320 agcattctaa taataaaatt tgactcattc cctagttcat attgaccttc actaagcata 46380 ctgttctggg gcatttgaat aatttgtttc aaaagctatg agaccattct tttaatggtg 46440 catacgctct aagataagaa taatggattc ccattggttg atgatacatg caaaggtagg 46500 gatatactct tacagtgtgc cttcctgctt ttgattcgtg gtttctcttg ctaagtgttt 46560 tctgcatcag gaggaggtga cagcagtaag tggtgcctgt aaagttggaa aagggaagga 46620 attaacatga attaatgtct gtgtgagttt ttaaggaact ggaatgtccc tctaaatcga 46680 ggatgtgttt tttttgtttc tggttgacaa cttcctagtt ttcctacact tcattattct 46740 tttctctgtc actgggattt ttttttcaat ttcccgaaaa gcaaatgaag tgctttatcc 46800 ttatcagtag cctttctttt cctgttccaa taacaaaaga acaatacagg aatttaataa 46860 tgttaaaaaa ttgctgtgtc tttttcttaa tatatccatt tatttgtctg cagataaaat 46920 ccaagtttta cagacataga ggaaagctgt ttatataagt aaaatgagct actattgtat 46980 cttgtccttg tcagggtttt ctttttttgt cttttcaccc agtttagaaa gtggattttt 47040 tttttctctt taaaatgtgt gcccatgttc tgttattaag aggaaaagtc ctgaaaatac 47100 aaagataggt gaaaacactg gttatcctct aacagaagga cactttaaag attcctaaaa 47160 cacattctga tcttgtttat gtatctatat tctattaaat attgttattt actcttcctg 47220 ctgctttgga aaaaaaaaaa gtccaatctg cctcacagac cttttataaa tatcagagtt 47280 gtggttttgc aacagtgtac atgaaaagac acttgtttca gtgtttgagt tcatggcagg 47340 gtgcattaac gtggcgtctg caagtatcct tgaagccaca ttgaggaggg aacttgggac 47400 ttgttccatg gtggtcctgc tctgacctca gaggaccttg cttgctgact tttttaaggc 47460 atgaaacaaa ttctcatatt ctcagtcgtg tctgacgcag atctgggtac tagtcagggt 47520 tctaaactgc ctaaataaat atcttgcata taacatctaa agtgacattt atctctgtgg 47580 tactaaaaag aaaatactat tccacttaca tgttcttggt gccctttggc attttgattg 47640 ctaattacaa taggaccttc agattgcctt ctgattgaga cctaacttat ttctgtttca 47700 tctggaattt ttcaaataca attattatgt ggtgttaaca cttcattgca gcatggaact 47760 cccaaaaatg ttgtgtatat attctttcat gcctctcagc ctggcttcat gtctgtctag 47820 gtacaggtct tattataatg ttgcccttgg gggtcctttt gatatatcaa gaactaaacc 47880 tctttagtca ggagagtgta ttctgtatgg atttataata aatagtctgt gttcatatgc 47940 ataggaaata atgcattttc tgaaaaaaat tatgcatgtc acttgtattt tttgtacttt 48000 atgtgccgga agaacattgt atacaaatat gtcccatgtg catgtatgaa tttatatagc 48060 tacatgtata taagcacata tacatccatg atgatatcat aggtaatatg tggaaataag 48120 tgcaaccatg tgattatttt tatcttattc tttgtgtgtg tgtgtgtgtg tgtgtacagt 48180 tacctgtgtg ctattaaatg tgacagtgcc gataatagtg ttacattctc tggttaagtg 48240 tttcctgaga attttggtct tccttaatgg tgcacattag ttatagactc ttaacccgtg 48300 agatacattc cagtggttcg tttctaagaa tcgatgtaat attatgcact ttgtaaatag 48360 cttatagagg ataagctatg actcctgaaa taaatattat tcaaactgct tccattttca 48420 tttcccttat cactataaca accacaccca actttgcaat taaattgtgt tgtctatctg 48480 cttcccgcac actagtgaag ctgaccctat tgaattcaaa ggaaatatga gagataaagt 48540 cgaaaactgt cttccttttg ttgacatttc taaagcaaca agtcagaaag cttgactccc 48600 tttaggtaga ccagagctgc tacgggtcat gattagatgg tatgtccact agtaactgtg 48660 atgacgccca acagaattat ttgaaacatt ctcagtaaat aaccaagaac tccagtaagc 48720 aaaaggggag tagctgttaa cttgcagttt cagttttcta caaatggggg cgcagggggt 48780 ggggggcgca cggaataccg aggacaaagg taaagatcga catagaatgg aagtcaccag 48840 atttctgcgt ttaacccatt cttgcccagc ccaagtgaaa ttgtgttttg gagaagcctc 48900 catatttggc catcgagaga ttggcagtgg gaattcttgg ctgtatttta ctgctctctt 48960 tattgaaata tttgccacat ctgacagttc tgtcactgtt cagttcagag ggatttttaa 49020 atcatatact gactttgatg cttcatggat ttgtattatt tgtgtattcc tcagtcatac 49080 taacctttgt atcaagattt ttgccatatg actcatatca aagtttttaa gacatattta 49140 aaacaaatga attttaaata gtggcccatt tgctatatgt taaatttgat atacaaaatt 49200 ttatacatct taaatttata tatctttgat atcttaaatt tggtatacaa atggtggccc 49260 atttgatata tgtttgctta acataatgta acatgattga ttttgttata tttcccatga 49320 gctgaggtac tgttgttttc tattaccaga gaagacattt cagatctttt aaatcttatg 49380 tgaagtactt cctaagaaga gaagtaaaat gagtatatct cacattagtt actattatct 49440 gcagcatcaa aaataaatga atatccccag gaaaaaggtt ataaaattac aacacctaga 49500 cacattgagg taggtagtaa atgtgaaatg ggcttacgaa tgactttatt cacttactgg 49560 catttgggag ggactcaggc tgtcaggaaa atgaaccaat ctgtattaat ataaagcgct 49620 agaaatataa ataaatcatg tttgaaagcg ataccaggat tttccttgaa tgctgctgaa 49680 taaattctgc ttgttaaatg tcagtgaaat gtatgtcacc tgcatgcata aagtgagagg 49740 gccacaaaaa acatccactt accactttga aatatgtaag ccctttaatg ttcctttgac 49800 cttcagttaa aggacatagt ctcatagcag aggaaatgcc tcaaattcta gcatctgggt 49860 ttgtttcttt atactttttt ggcatagtta caacagtagg gcttttctgt cctttaattt 49920 ttttcacagt acagtcaaca caggcttttc acacaagcgg gaaaggacgg ttgtgaggcc 49980 aaaagcaata tgtttcttct 50000 21 50000 DNA Homo sapiens 21 ctgtgctcca cggcagagcc taattgaatg gcttgttgtt tcagagagag cccagaacca 60 acaagcagac aaacagaatc ccacacaaat acaaattgcg tataaaatta gatacctaca 120 cacgtgcgta tcaatggata tgtactaaaa atgcagtcag cgatctctta ttttgagctc 180 attggcttag tcaccgtgtt ccaattcttt catctttgca gtttttggat tttcatctga 240 attttcagct ggctaaagat gtttcttcta cagctgaaaa gagaacctag tttcattaca 300 tacacttgta aagtgtatgt atttggatgg gggtgctcta tttcagccct attcatatgt 360 caaccatatt gtcatttatc attagcaaaa ggtcactttt aagctttgca gaatgaaacg 420 gtgagatccg attttccatt aattaaaaaa tatacatcat aacagtatgc tggagacatc 480 ttcatatatt tgtgttttcc ctgctttaca tccaataggt ttctgtgtat gagcttctcc 540 tgagactgaa taatagcagc acaccctaat gcaccttttt taatgggaat tcagttgggt 600 accatatgcc tttctcactg ctgaaggcct tcatgctttg gcatctctgc tctcatctta 660 ccggtgattg tgggccattt gtctggttct gacatggaaa tattaggaag cagtgctgtt 720 atatacaaat ccttatagcc ttaaaattaa atgcagctct caaaagcagc ctaaaagtta 780 aatgacacac aaaagcttgt tttgtctgaa gtttatatgt ttattctgta gagttacatt 840 ggatagggta gttatcagcc cttagcaaat gcctacttag ttttccttaa tctgttattc 900 attcattcat tcaacaaata tttattgagc gcctgctctg ttcagacact gtgctagatt 960 cagggcaaac aagacagata aggttctagt ccccccccga gctttcagtc ttaagtggca 1020 aaaataaaat tatgtgattg ctgttgacgt acagggtact gagactgaga taggatagct 1080 tgagaaagtc tctcctctga agagctgatg ttttaagttg gactgaaagg taagaaggag 1140 ctggctgtgc aaagagctcg tggaagaaca ttctagacag agtgaacttg cccatgttca 1200 gtgaccccag tatgggagaa agagttttgt gtcccgaggg acttaaagtt gtgatgtggc 1260 tgtagtgtat tgagtgagtg ttggggagag tggcagaaga ggaagctgga gaggcaggga 1320 agggctcagt tacacagaca acatggtaag gagtttggat tttattctaa gtataatagg 1380 aatccatgaa agagtttgaa gcttcagatg gagatgagag gatgcccttt ttaagatcaa 1440 ctgtggctgc catgtggaaa atggattgta gcgggaagct agagtagaat cagaaacatt 1500 atttaggcca ctgaagaaga gcccagccag gagactgcta tgtggctgga ccagggtgct 1560 agcagaggag ttggatagat tctctgttcc atttggaaat acagttaaaa gggctcaatt 1620 aggttggatt tgggggtgag gttgggggta gacttcagag tgaccccagg gttctggctt 1680 gagcagttgg taccatttcc taagatgggt aagactgggc aaaaactaag gggagtcaaa 1740 attacctgct tgagaaactg agagtatact ggaggcagaa gtgattctga gtccttttct 1800 tcgtgtggtg atgctatttg gaaatcaagt cattagagtg tttcactacc tggaacacat 1860 cagtaaaatt gtgaaaattc caccttggta gctcttggtt acagctcttc agctttgtct 1920 tgtgctgttt tatctaatcc tctgtctctt ctgtaggctg tttctctgac ctttggagaa 1980 ggcttttgtt tcttccccag ggatgtcaac acaacccata tcccaatttg tttggcttcc 2040 ctttaaaatt gttcatagat tctttctgta agtttttcac tacatgcatc tactatttca 2100 ggcatgatta tcgtcacact agaatcccgg tggaaccaag gctagctggg ggaaatgaaa 2160 gattgtgagt gtcatatctc atgcacactc cagagcatgt ggagaagccc ataccattta 2220 aatcattctt cctccatccc ctaactgagg acttatgtgg taggacctta ttatacttag 2280 tctatgtctg tgtgcttatc ataccacatt ttattttcct gtgtgcctgc caatatcctc 2340 tgaaagcagg gacggtgcct ggcatctagg ggatagttaa tgagtgtctg atgatttaaa 2400 aaaataatgg attggggctg ggcacagtgg ctcacgcctg taatcccagc actttggggg 2460 gccaaggcac gcggatcact taaggtcagg agttcaagac cagccagggc aacatggtga 2520 aaccctgtct ctactaaaaa tataaaaatt agcttggcat ggtagggtgt gcctgtaatc 2580 ccagctactc aggaggctga ggcaggagaa tcacttgaac ctgggaggtg gaggttacag 2640 tgagcagaga tcgtgccact gcactccagt ctgggtgata gggtgggatt ttgtctcaaa 2700 ataataataa taataataat aatggattgg aactatggaa ctaaattata aaagataact 2760 ttcatgatat gctaaaaaat gggactttat tgccattgag gaacctaaat tataaaagat 2820 aactttcatg atatgctaaa agaatgggaa ctttattgcc attgcaagct tgtgaggaga 2880 gaggtagcat gtgaagttaa gactagaatg accggttcct tgattgaaaa caatcaataa 2940 aggtgctgtg aacagattat cagaaagtat ctgtaagtat aaaacacatt cagaatttag 3000 aaaacaagat gtaactccta ctacaacact gcgtttgtct tctgaaagga agagacatca 3060 caggtttagt attttttgac ctgtcatttg attttagttt acgcctaatt tcatttaagc 3120 ttatcctttt ggccatagtc ccatttcttt tcagtgtttt tgctgccagt ttttcagata 3180 cagaaggaaa gtcatatttg ctgtgtgcag ttggaagaaa ccctggcaca gttcctccag 3240 aaaccaaaga cagaaattat tagccatatc cagaccagta gctgactggc tccaattctt 3300 cctcccatga gctttcctga ttaatgggag aatctgttct gtagtcgaac agcctctttc 3360 ctgagcccag tctgtcctgg cttcagagca tgaccagctg tgcagctgaa gccattgcca 3420 gggctctgtt tctgattctg ctcctcagca tgttggaggt gttggacggg gaagcagagg 3480 ggtgggaaga gcatccaggt cagtttagat gtttcagcag gtaacagacg tggcaccgtc 3540 cacccagaat cccgccactc ccattggact ctgtcctctt cagaaggctg tcattctgca 3600 tggtgcagat gttgcagtcc ttcccttatt cttttgtctt tttgtccaga ccacttagct 3660 cctgtgtctg tggtgattaa atgctttgta actttttctg acatcctttt cttcacccct 3720 cttctctctt cccctactgt ctcgcccacc atacctcttc tgttttctgt ttgcctttct 3780 ttcctctcag ctattgcatt ttccccttta cactgaaata gctttgttta ttggcctgtt 3840 tccagcaata ttaacatatt tcaatttttt actcacctgt ttaccaattt aacaaagcta 3900 aggtggttca tggcagtatt tggggcctga aatatgaaat actgtcatag cagattgtat 3960 acgttcaatt tctgttacct ttctcttacc tttcttcctt attctagact caatctattt 4020 catttcaggg gtctgttatc tgagtgtttg gagtttggga tccttgtagg atcttcaaag 4080 ctaaagacct atctccactg catctctgag atcttaaaaa caataatatg aagatccaga 4140 ataaattggt gtcagttaaa ctctaaccaa agaaggttta aatgcagtga gcatttgagt 4200 gataaataag gtacggcccg aactcaagag gcttccagtt ttcttgggga gacaggcatg 4260 aacaccactc actagaatag aaagtagaat gaaataagta ctttcataat ttacaggtac 4320 gagaaaaatg cacaggatta ctgagaaagg agttatattc tgactaggaa aatcagaaat 4380 tgtggcattt tctgtgaaca ttgaaggatg aacaggattt cagtaaatgg agaagaaagg 4440 caggagggct tcctagacca aggaggtgat ggtggtggtg ggtggaggag aattgtgcat 4500 gccgatcttc tgtgcagcac tgggtggcag gcacattgca cctaaatgag cactgactat 4560 gctttcaaga agttgggact aagggaacag gtgcaggtta gaattgtagc ctgaaggtaa 4620 agtaagagct caaggaggag tgtttacttc ctgaatttta gaaatgtagc atgtttataa 4680 atagagccag cagagaggga gaaattcaag gcacagtgca ggagtgcaag gccccaggta 4740 acacgacaga ctcttctttg agtcctgcct ctctcacttt tcatgtttga gctcttttta 4800 ttgtttgctt tttgagatgg tctcgctctg acacccaagg agaagtgcag tggtgcaatt 4860 atggttcact gcagcctcca cctccccagg ctcagcttcc caagtagctg ggaatatatg 4920 tgcacaccac cctgcctggc gaatttttgt attttttgta aagatggcat ttcactgtgt 4980 tgcctgggct ggtcttgaac tcctgggctc aggcagtcca cccacctcgg cctcccaaag 5040 tgctggcatt acaggggtga gccaccgtgc ccagccactt gagctcttta ttctgccatg 5100 tccgggttcc ccttttctat gcaatgcaga ggtggtataa agccactaaa ccaaggaatc 5160 agcttcagaa acaaggtaaa gctgccttca ctcaatggga aatcacatct atagatcacc 5220 gctctgttgg cagggtcaga tgactgaggt gtcttgtgtc ttgatcaaaa gttttgtttc 5280 cactgggtct ctgtcctgag gtctccatgg atctgtgacc ttggacaagt catttaaccc 5340 ctaggcacct tccttgccct ctctgttaaa atgagatgat ggtcccagcg ctgtcttagg 5400 cctctgtgtc accaccactg gagttgttta aaatccaggc tgtataggtt ctgtctcctc 5460 ttctcctccc actgccttcc ccacttctcc ttcctctctc ctgctgtagt cattagcctg 5520 atgtatcctg acagtatccc tgtaataaat ataagtggaa tttgctaatt aaatgctagc 5580 atttttaaag aaagtttcca gcaattggta gagtagctga gtggtttata ggcttcctat 5640 tcaagaggga ggtgtcctaa ggtttaaaat taaggttcta gtcataaagg aggcttttat 5700 tattaggaca acatggtaat ttggttgaga gaggacttac tctctgcaac attaaaacat 5760 acatatactt ttgacagttt cagtagtagc tttgtccttt tgatttactt ctttggacag 5820 ttgtgattga catcagattg aacacatggt tcagtaaaaa ctacagctcc ttgaagtact 5880 tttgttaata ttctaatctc ctatttgcat cattaacaag agcacatttt attgccgctt 5940 gaaatagact cattgacatc ccagaatagg tagttttgat attagctact tgttaatgct 6000 gcccaggcca ttcacattcc atgttcttga taatgtacca cgttcttagt acacagccct 6060 tacttaccac attgaagaga ctgaatgcac tgattttaag aaggaggtct tgtgcctcct 6120 agggggacta gctcttttag tattatttct cttatttctg aagcttatgt tagaatctga 6180 gcataaagct tcatttcatt ttccctggag ccaattatga tctcaaaatg agaagtgcat 6240 gagtaaataa atttaaaggg ggcttttggg aaatgttctg acataggcga tctgcgcagc 6300 acatctgaaa tacctgggat gtggttatat acagtggtca tcatattcag tggtgacatt 6360 cagaaatgat ggcagtttat aaataatcct aatttgtagc aaatatgtgg ctttaaggcc 6420 ctgtgatgtg attgttttct ttgttctgtt gatcttgtga ctttaaagtg gcattcagtt 6480 ctctgttcat tttaaatgtt tgtcgcttct gacttctgtt tcctcttatt gttttctgtt 6540 tttgtgtttg agcccttaac tcaggtgtat ttatttcccc ctcccaactt cctgccctct 6600 ttcttcttct gatatcttct ctttgataac catgtcttcc atgtttagtg ttctcttgcc 6660 ctccagcaac ctatgttagt ttggttcatt tgtttaaaaa caaatctcct atttttgaga 6720 atgagaatgt ggtgttatta actaacattt agagcatgct tatatatcag atgtccccag 6780 gcattttcta tggattatgt tatttaatat acccaagaat cccatgaagt aagttaaagg 6840 ggtgaagtaa cttgcccaag atttcacagc agggagaggt agagccgagg ttggaaccca 6900 gatggtttca ttctggatct cctattcata accaatcctg ttactttatt gtctcaactg 6960 ttcagtccgt caagatttaa tgatcaccag cgttgtccca ggcacagtgc agacacaagg 7020 acactagatc cctttctctt gccatatgta cggggtctta agtgccattt gagaggttct 7080 tccatagcac ctgtgcatgc cagtgcctaa ctcttttcaa aagttcagga ggtcttcatc 7140 cctgtggagc tgaggtttgc aggataatta agagtttgcc agctggggaa gcagagaagg 7200 catagggaca tggggtaaga aaggagagca cagtcgacac tgcaacagct ctcatgtgac 7260 tggagcatag gtgctgccat agcagaacag atgggcagtt ggggaatacg tcgtggggtt 7320 ggttgagtcc aggtggccaa gggctttgag tgtcatggca cagaatttgg acttcatttt 7380 tcagccattt aggaagctag tacaaccttc ttgagtctct cctgcctgtt ttgcctcatt 7440 ccacagaggc cactggagat ctggaaaccc cgtggtgttt aacattggca gctgagacct 7500 ctacgtgatc agcttagagg aaattttatt gcagtagcat tgtaaaaact agtgttttca 7560 tacagtcttt tgatccttgg tcccgcataa ctactgttaa atccaaaacc ataagcagaa 7620 atttaacatg agtgcttata ttcttaactt gtaagatgaa tacagagata ctttccaaag 7680 aagtttttaa tgtaaattta aatgtatgac ttttatgcag gtatggtatg tgctttttcc 7740 tgattaaatt tcctttttgg gggagaaatt gagaatttaa atgatgtcat gaaacaaaag 7800 gctttggata ctgttgaaaa gctgcaatca gtactgttta aaatgcttac tcactattac 7860 aattaatctt tataaagcaa acatctttct tgatgggact tttttcaaaa accattttta 7920 ttagtctgtt ttctggtggt agtaagatga ctagatatga gcccagttta aactcaaatt 7980 cttacttttc cactttcagt aagaaaaata attagcatta acatttcagt ggaattgcaa 8040 atatattgcc ctagcttact cagggaaagg gaattaccag ctggcgaaaa agcagtagaa 8100 tgaagtgtag tcaggcgtta tctcacattg agattcttag gtggggactc tttgaaacaa 8160 atctcaaaaa aattttaaaa tccagaagat ggcctctttt cacagccatt accttggagg 8220 cccattttct tgtcctaacg tagctgccct tgcatgaagg aggaatgttt tttgtgtctc 8280 ttgctctttc cctcctttct ttccttcttt ccttttttta tttttaattg ccttcagtgt 8340 ttgcagcagt tttttcaatg cctttggtaa tggcaaactt tgattttttg aagttggatt 8400 taattttggg aaacagccaa aaggcatatg gagtcatacc tgattaataa aataggtaag 8460 cactttggag aattctattt ttgctaaaac ataagcttaa actattaggc actgaggcag 8520 attttattgt atatctcatg aactgccttt aaagaaaatg tcaaaaggaa cattacagaa 8580 aaatgttgag aagtggcaga attgggatac tagatagctt ccttaggtga ctttgaagaa 8640 caattttatt gatgtgaatt tataagtcct attattttgt ttcttggggg atttgtctct 8700 attttgcggt tttatcttat atattcaata tagttaatgt tctttaatta aaaaaaatct 8760 ttaggatact ttagtagcac tcagtcttta atttgaatgt ctgttagtgt agtactatat 8820 atgtggggat atacctgtac cagttaatag agtgtgtagt taataaaagg tggcagggtg 8880 gtaaaccttt ctccccctgc ataattgaat agttttaagt gccatttgaa atcctgaata 8940 tccaagattg gattccaata actgtgtagc ctgagaagct ttgctgtcag cagctgctgg 9000 tgtttgtaag agttatttca ctacttatcg ctcttcagat tcaccatgaa tttaggtagt 9060 aatttaccat tgcagaaggg gcaagctctg cttctgatag tgttatgata ctatgttttt 9120 gcttccttaa tgatacgatt cttgaggtag aaaaattaaa aatatgtggt ataaatcagc 9180 ttcatgaatt aaattcccca gatttaatga tgacagtttt attcctggga tgcttacgct 9240 ataactccct aaatgcttca tttaatactt cggaaatggt tgttctcata ggagctgtta 9300 ttgcaaagtc agtaatggct ctggggctga aatttgcttt cttaattcta gtccattgtc 9360 ttttgttgtc atgtttcctc agtttcctta gatatgtcac accaaggact ggttacagaa 9420 atactgcagt gcattatcca atctaagcct gctttacttt ctttcaaatg agcttttaag 9480 catataagta gcgtatataa tggaaacaga atttagggtg cttaggtaag aaaggaatta 9540 cttccatact agagtatttt acagctttgt ttgatctctt aattgtctag atcagagatt 9600 cttgggtact ttcaaattca ttttaatgta ccatagccct ttaaaaatga caaataaatt 9660 taggcagttt gttttcagaa ctgatttgaa gaatttcagc atactattta aggtaagtag 9720 tcctatactg gggatcctaa gctagagtca gcttggactc gttgaaccac gtgaccaccc 9780 agggtttcct atagagcacg tttttagcta tttaaaataa ttaccatgct ggacctggag 9840 acctgtgaat agtccataaa gttaattaca ccttgagaca caccctctgc tcagctactc 9900 aagtaccgta tccactttac aatgttcctt tgttaacaag atctatgtgg ttgaagtatg 9960 tattatagta atgtgcaatt cattgtgtac agacatttcc aaagtaagat gatggtaata 10020 attacacagt aaaagagttc aggccttttg cgattgttgt gtagctagga tcggaaggct 10080 ttgtaaactt gatatgtttt tgcatacttt gggttggtgc tacctgatta tggtactaaa 10140 gtatttgagt cctctgccct tgtatcattt gtttgttaga ctttacattg ttgactgggt 10200 tccatgattt ataaaaagat cccagaatct tttgtaacta tttaaaggtc agcagctaat 10260 gataatcaga actaataata gatgcttact tatgcaccag gtattagtct aagccattga 10320 tgtgttagtt aatacctctc aacaagtaag tgaggtagag acttttatta ttttatagat 10380 gaggaagctg aagctaggaa aggtgactga ggcccacaga cttgtagtaa gtggtagaac 10440 aaactaggca gacaggtacg taggtaggtg gatagaaaga aatagagaaa aaaagagatt 10500 taagaatagt agaaggtgtc agtaaaggat ataatagctg atttcctgcc tgctgtggtc 10560 taggcattat gttgggctac aaggaaacaa aaatgtatga aactcaggcc cagtcctcat 10620 ggaaatcaca tttggtcatg gattaccctg atataagtgt aaataattaa tgatgcaatg 10680 tgaacatttt ataaagtagg ttgtgcataa agtctacgaa aatagactga gagaaggaat 10740 ccaacttgta ggggaggcag agatgaagag atattcacag aggagaaggc ttttgcttaa 10800 agatgaccac agtgcccttt gagagtggga accatgtatt gttcaaccgt gtctctccag 10860 atctctagca aagtgcccag gatgtcatgg ttgctcagta aatgttgggt ggtaggtggt 10920 agcagtctgg ccagatgggt gaatgttggg gaagagcatt gtaggagatg gacatatctg 10980 atgaaaagca cagaggcaga taaggccatg gtatgctcat ccacagtagg gctgtgtcct 11040 tgggaagtag gacatatggg agcatggcca ctgtgctact tcacacatca ctctcctttt 11100 acttttagtc tctactccca acatcctcac tttacctgct ttaaatcgta tttgtatttc 11160 aaaaccctga taaaaatttc tcacttcctc cgtgaagttt tttcagcagt tgccatgcac 11220 cctagtctct cctttctcta gtcacaatcc tttttgttcc tgccactcat gagcctcaat 11280 ggatatctca cttacactga tttgatcttt acaaattgta tgcatgtatt aaattatcat 11340 atgtacccac ccccaaaata taacagattg cctgcagtca cctctttgat tattgttttc 11400 ttttgttaaa ttatttcctc tattctctct tcttggagtg cagagaacct taggattcat 11460 atgctaccct tcctattggt acaagaggac acggaagccc attgagatga aaataattgt 11520 ccaaatcacg taagcagtta aagtcaggcc cgggactgag tccatggctg agaccaatcc 11580 atgctagcta atgttcctct tgttcttccg agctcagtag tgagtttgtg gaggtgaggg 11640 actgggtatc atactcactg tgggtcaagt agctgctcag gaacatttgc aaatgaaaga 11700 atgaatacca cttggctttc tgagtaatat aaggataatg tggttcccag ccttaataaa 11760 cagctctagc agactaaaca tattgatagt aagccctcaa ctgttagata tactcattca 11820 gaaataatat cagaagatca actgctgagc agaattttca tgttaaaact atttcactag 11880 gagtactggt acaggtcggt gttagactca ttactgaaaa ctcacttgaa gaactctatt 11940 agctttacat atttctagca gattaaactt agattactta cttttaaatt attattggaa 12000 gttagtctaa gttattttcc aatatagaat cattgtctta atttaataat tcttgcttca 12060 caataacata atttggaacg gaaattagat tccagagtgc ccttgtaatt acaaattgaa 12120 aaatcctttt ctgactcaaa agggggctta ctggtgacaa attccattat cagcaacatt 12180 ctcatcagag gctcagatga ggtggtgcct tctgatgcat tcttaagtgg gtaatgggta 12240 gaagagttag gtgtgaggtt attgctaaat acgaataacc atgaattata caagatttat 12300 tactcagaaa ccaacagagt agagccttgc aattcattgc tttaacattc atgaattcag 12360 ttatttatga aacagtgacc cagaggatcc atgtcatgta gtggtttgtc atttgaatca 12420 ggtgaagccc ctgtgcaggg gaaaatcact tagctggtga caagccaacc acccaattgt 12480 ttttagccca ttatcacata tgtaggaagt catgaactag tcaaaactgt tttctttgtg 12540 aaagaaggcc tggtgatggg agtgcagagg aataacatgt tcctagaaag tgatgattct 12600 ttaccaaaaa acagaaatct gggaccaatt aaaaagccgg catgctgtat ttggcaatag 12660 attgaatctc tgaaatgcat agtccagaac caagaagagc aatttctgga aatacctcag 12720 tgagtgtact ggccagtgta aaatttcact ttctgggaca attatgatcc atccgttttt 12780 tataggaggc ccagtaatga tatcttgact gaattgcctt tttgagttac aatcattaat 12840 cttgaaggtc atttgagatg tctcaagtct cttatttaag tctttgagat caagtaaaat 12900 ccctagcatg tatcaatagc cacctgtatc aaacattgtg tatgtttttt aaataaagaa 12960 tatctggcta taggattccc ttagcctctt tggaagaaat tatcttgaat agggaagata 13020 gtgaagaact ggtatgtcca agtagaacta ggaattagag caatagtctg gatcctggag 13080 ctcagaaaat gggcagtaag tcagggacaa aaggtggctt cttaatctgt tgttaaaaat 13140 tacttagagt agaaattcac attgagccag tccatcaagc tcccagtatc actaaagaaa 13200 acctgtggta ctaccactcc cccttgttta tatctgacct cccacaaggc tccttttttg 13260 tatgtgtagt gctttcccta ttccagacac atttttatgt tttttaatta taacatggac 13320 aaaggatagc attgacataa accccaaatt ttaataaatg actttcctga aaatgagagt 13380 agcagtgact ttaaaaatgg taagaaaagg agccgtgcgc agtggctcac gcctgtaatc 13440 ccagcacttt gggaggctga ggcgggtgga tcacgaggtc aggaattcaa gaccagcctg 13500 actaacatgg tgaaacccca tctctgctaa aaatacaaaa attagccggg tgtagtggtg 13560 tgcgcctata atcccaacta ctcaggaggc taaggaagga gaattgcttg aacctgggag 13620 gcagatgttg cagtgagccg agatcgcacc actgcactcc agcctgggtg acagagtgaa 13680 actctgtctc aaaaaaaaaa aagaaagaaa agaaaaagaa gaggagaaaa tggggaagaa 13740 atagagaagg ctggcttcat cctcctcatt tgctaggtag gagatctgca gtgccttcac 13800 aagactaggg aggcagatgc atccactcaa tatagggata gagtctttca gccatcagca 13860 ggagatgttc aggtaattaa cttcccctca taaaaccaag gatcagaaca agagcagaga 13920 agcactgttt gaactgtgag gaagccatgg cagcatgaga gggtcttgta gacagaaagc 13980 tttcagaggt gggagccgta acaagccaag gcctgaagca gacacacaac ttccacagga 14040 agcattctac catctttcat gttgagacac cctgtctagt cttaccagtg gctctggtgg 14100 aagagcttct actttcctct ggctggactt catcttctga atgggttaag gagactgcca 14160 gaggtctaaa cttctgggga ttcagttgtt acagagaagt ccctatggta tggaggccct 14220 aggttgttcc tcatgggcgc tctctctgtc tagggctcca gccccaagac ccttggtgta 14280 cagacagatt gcccttgtat tccatcttgc aggcctgccc tggctcccat tcctagccca 14340 tttgttcact gtgtctcagg gggcgaggct ccgcctgtgt ggctgggtag cctatcagca 14400 cctggctggt ggacagcact atagctgtcc tatgactaaa ggttgctctt ctgacctttc 14460 ttagccactc cactaataaa gggttatttt tgtgttggtt tggtttgtat gtttagctga 14520 tttcatgata aaggaacttt gtaatcaaac actttttcag ctagattcta agcaatttta 14580 aatattatgt aagagatcaa ataactatca gtgtaagatg cagactacag actgaaaaat 14640 gagaaggtta ggatgatgac tgtgtgtttt gatggaagca gtgcaggact tcccaggaga 14700 ctgggcttct ggccctgcct tgacattcat aggtaaatga cccttgggca agtcatttgg 14760 cttttctaag tcactgtttt ctttactcat atctacgaat ttctgattaa gacaactttt 14820 cttctccttt gacaagtaag tcttgtagac agaatcattc ccacatttgt atgtaatatt 14880 gattccccta ttctcttgta gttaaagctt caaatctcta attcatcttt tttgtaggaa 14940 ggattagatt gatttattga aaattctgag ctatagaata ctgtagaaga agcctgggct 15000 ttctgcttaa agttcatata tacagaaaat ctaggcttgt ggatgacaaa gcgttttctg 15060 tacctcattg cttttattaa aatacccaac ttattatagt gtctcttatt tagcagttgg 15120 ttaaatattc ctggagttaa tgagtaagaa tcttctatct tgtaacaaat agatgtagag 15180 ctttggtaag aagtacacat attacttgtt tgagtgtttc tcaagtgatt cagaatgctt 15240 tacttacatt cgctcttaag gcaaaatact tccattattc ttacagttca ttgtcttagg 15300 aaacttgttg cagaagtttc taattgtccc catatacatg tgttcattct cctcttcctg 15360 cgtagatata aaatgtttaa agatacagtc acccagaata aagacaccgt tgagggaggt 15420 gtggccacgt gactgagtcc tggtccatga agtgagcata aatgacagtg ctccttttcc 15480 cataatgctg tttagagttg cgccttcccc ttctactttt cccctctccc acagttggaa 15540 taccatatgg acaagggctg caccgtaggg attgaatggg caaaaaagct gaaaggaact 15600 taggtcccta gcacctgtgt ggagcccaat ccccataaca tttatagaat taaatttcat 15660 gacactctta ttgcacataa cacaaattcc caactactgg catgtggctt tgtgagagct 15720 gtttcaccat aatctgttga gacttatttg tggaatcatg atgtcccgtg acatttgaaa 15780 atggaagaga atcctggaat atgggagcaa cactgacgca gaacacagtg ttcactttgg 15840 aagcagtcat gatctcttct tgctgtggaa gcctcaacag tttgcctgtt ccctccattg 15900 tactcctctc tgaattctct ctcaggccta gatgctgggc tctgtgtggg ttgctgtatg 15960 gtatagaaga ggatgagatt ggcggggacg gctcttgttc aggtccatca agactgtcgc 16020 tgtcctctat accttgagcc agctctactg tgcatttata gaaaagagag accagcacac 16080 agaaagagag aaagcagctg tttttagggg gaggagagaa gatggtggag cttacaagag 16140 actgatgttt gcaagtggat atggtataaa gtagtagttc tcaaagcacg gtgcctagac 16200 agcagcatca ggagcctgtg aatttgttag aagcacacat cctgaggctt cacccaagac 16260 caggggataa gatttggcaa tctgtgctta aaaaagcgtt tgccttgctc acttctgaga 16320 accagtggtc taaagagtgt tgggaagcct gacagtggta gttacaaggg tgtaccctga 16380 tgggagtcct gggatcaaaa agacttgaac tagaagtaag ggaccaggtg agggagcatt 16440 cccttcagag agagttgtta aaataagccg gggttctcat gatgctgctg aaaggagtgt 16500 aagtactgta cctcctgtgt ttgggaccaa gggtgtatct agccttaggc ttccattcca 16560 gcacatcccg tcagccagat gtgcaaaagt tcctccgtat acacactgta ctgcttaaat 16620 tgctggtatt ttgaaataga aaaaggtgct tataaaccca tactgactgt gatgaacagg 16680 gagtactttt tccacctacc tctgcttggg aggctgaacc ccatcttgcc ctggaagatg 16740 gctttgagcc agaagccaag ccctttaatg gctatgaagg ggggaaaaag tatgatttcc 16800 atgtaagggc taccaagaat ataagtacct atttaataaa ttatttgttg aaatgccttt 16860 tctattttgg tagttgaagt ttgaaaaatg tttggagtac tataattatc atttctggga 16920 aaaaaataca cattgcagta tttttgtttt tgctacaggt caagttgtag ttcaggatgg 16980 aaagtaaaca catccctttg ctcagctgca gctcacatat ttgtagtctc tgggtgccct 17040 cttggcagtc gtggtggata cccaaaggac acaaacaggg catagaaacg cggggctcca 17100 tcaggggctt gccccagtgc aagtctgatt aggaagtttg ctggagccga aggaatggcc 17160 cagaagggtg gagatgcctt cacagaagga gtcctaggac cagaatgagg ctgttctcag 17220 aggcatgccc acctctggga ggggtctgca gaagcacttt ctcaccaggg agagccttta 17280 tggtatgggt ctaaacactg ggagatcttt gtttgttttt ggaggcactc cttaccattc 17340 tatgccatct tgtcaatttg acaactttgt aaacagggcc catccacaca tcttgaatgt 17400 agaaatagct gagaacagtc tccaggagaa attacctata taaggctgct tccttccttc 17460 tctccagagt ggagacgatt tataaagaga aagtattgga agctgacaga gaaacagcac 17520 ttgagagtaa aacttttggc atgtaagagg actgtgtgtg tgtgtaccca tgcctatgag 17580 tgtgtgtctg tgtttcagaa tcaactttac ctttctgaga catagcaagc atggagtgat 17640 tacatctgtg ccttgggaca tcgcgttagg tgttaggttt aaaacaaact gaggagaaaa 17700 cataagaacc tgtgagtact ctgtgtgtgt gtatatgtat atacacatat atgtatgaaa 17760 tgtttttata cttacacact ttatatttta cataatatga aaaagtatat attcttttac 17820 tctccaaggc agtcttgtag aataaatgtc tttatgccct gtacactcta ttcagcaccc 17880 tctttttgtt cttgctatat cattttaatt acataagtaa agtcactttc agaaatctca 17940 gatcaacaca cgatattctt tgaagtcttt ctaaaaacat ctgaaatcca tatgccacct 18000 ctgggactcg ggaatttttt gacaatatta cgataatacc cttggtttat tgttccttgt 18060 ctttaaaatc acgtaagatc acactttcta ctacactact aaaaactcta tagagtgttg 18120 acaaatactg gaggtttaaa ggaacatgac tggctgctcc tttaattcag caagcgtttg 18180 tcggtcttcc ttgaatgagc aggaactgag catccttgga acctgactgc ataggaaggg 18240 ctctcagaac atctggttgg aatatgttca taggagacac ccacaagggc cagcccctca 18300 ttttacagat gaacaccaga cccttcactc tgacatattt aaaatctgac attttattat 18360 gtactttttt caaattgctg tcattgttgc acaaaggaag aaccaagtta ccgtgtccct 18420 tcctcagtgg aggaaaagtt gatagaggtt ttatttgttg ttctaaaatg gatgccacct 18480 tccggtaaac attacttggg gagttcttaa tttctaatgt ttatgcctct gtgacatttt 18540 caagctgtta taaagagtct tcttaaaatg ttgctgagct ggctatacaa atttagttca 18600 tttaatttac ctcattagtc atttgtccca ttctctaata attttgattc cttttttgtg 18660 ggtcagtgtt tatggcaaaa tgtgttcggt gggggaggta ttcagagaag actgtagttt 18720 attaggtaaa attaaacgtt ttgaaaccag tctttagttc tttgtacatt tatctaaacc 18780 atccttaggc cttattttat tgccagattt ttaaaaaaat aaattcattg ttacattttg 18840 aaatttgtta tggtgtgtga atatataaat ataattattt gaatggcaat tttttctggg 18900 agtaggttgg atatattttt ctggtgtgaa ttaacagatg aaataaaatg tctctgaact 18960 cagtaaatat ttgaaaactg ttttaaaatt cagtaggatg caaacaggcc acatttggca 19020 tccggctctc ccactcccca gtccttccac cccactacct cccataaaag gcaaatcctg 19080 gaagccactc tgaacttttt tccttcctcc caggaagctc tgacttttcc aggaagattc 19140 cagggaacac tagttggatc cgatttccta agaattaaag cagctttaaa tctcctggga 19200 gtgtagagga tgaaggggtt gagggtccct agaacaatag gtcagcattg actgcagatt 19260 ggagtcagct ggggaactgt aaaaaaaaat atataccgat accagggtac cacttccagt 19320 aattctgttt aattggaatt ggtctggggt gcactctgag cttttggatt tttaatcttc 19380 tcctagggtt tttaatgggc agccgaggtt gacagtcatt attctggcac taaattttca 19440 aactatagta tgcatgagaa tcacctctag ggtttgttca aacccagatt accgagcccc 19500 actcccagca tttctgattc tgtaggtctg gtgggcctta gaatttgcac ttctagcgag 19560 ctccttagga gatgccgatg ctgctggtgc aaggatcatt gtttgagaac catcacttta 19620 gagcagttca tctcaaactt tactgtgcat cttgctaaaa tgcagattct gattcagtag 19680 gactggagtg aggtctaagt ttctgcattt ccagcaagct cccaagctcc caggtgttgc 19740 tgaggctact ggtctagggg ccacacttta agtaacaaat ctctaatcta aagcatcatt 19800 atgaatgcaa ttccataaca tcgtttggga agaaaagatg tgtggttcag gagattccta 19860 gaggcacggt ggggccgata tatgccttag acctggtggc aggcagagat ctggagggct 19920 cagtgcttct ttggcacttg gccagctgtg gcttgccagg gtgttggaac atgcccttgg 19980 cgaagctgta ctgtcaatag ggagcttcat tgtaaagatt ctctgtttgc ttggagtaag 20040 gcagtggttc tcaggcccaa ttgcacaata gaatcaccct gggagctatt taaacaagta 20100 ctaatgccca ggctcctccc ccagaatttc tgatttaatt ggtttgaggt gggatccaga 20160 cagtggtatt tttaaaatct ttgtgagtga ttttaatgca cagccagggt tgagaaccac 20220 tgaattaaga aatcacgtta cagtcttaca gatcactttt ttccctttat atttggctaa 20280 gttatactgc acagtgatgc aaactagctt tattaggtag tcaaaaacaa tttaatatgg 20340 catttttgac aaaccaccaa gaaaagactc tgcaaagatt ccgtcgctgt taaatcctaa 20400 attcggagtc ttaagatagt gctgggaaag tgcagtgggt gtttctgaac gagcttgaaa 20460 tatgaaatat ttctgggact gtatcaacaa gtctaatttt ttccccttgg ggttatgatt 20520 gtgcggaaca agtcttcaca gcaaagcttg ccaaaaaagg ttatgataaa atcatggaat 20580 gtgtatgaaa agaattcaca gtgtaacttt aaaggagtca gtctctaaat ttaaaaactt 20640 agggatgaca tacgtattac atttgtaata aaagttttaa gaacaaagta atgctgaata 20700 taagaaagtt cctcctttta tcacatgata tctctaaggc caaggaacaa gtgaaataga 20760 acagccagca agaaagtaat gaggatacgc taaagacact gaacctcttt atctttgcaa 20820 attaattttg tcctctgtat ttaagcaaaa atgtagctat tgactaaata tacagatacc 20880 agcaacgccg agtaggaaag ttgaagctct tcttgctttt ggcattatgg acgactagtg 20940 ggaaaatcat gggctttggc agttctgcca cttgctttgt gaccttggcc aagtttctta 21000 tcctctctgg gtctctctct tcatctgtca agtagacatg aagatataat ctcttcttca 21060 cacaggagta caaaggcgtg tgagaatgag tagaggcgtg ccccacacgt aggagatgct 21120 caattagtag agatttcttt catttctcca gcatgaccat gagaatttaa tgcttaataa 21180 gaagcattat ccatcaaaac atttttcttg ttgagtcttt attggtattt tctgcctatt 21240 aattgtatca gtttagattt cttattaata atcatgaagt ctgtttttcc acttgaagag 21300 taggcccttt cttcttgctt tagagccatc atttttgcca ttccctttgt ctggaacact 21360 cttgctttgt gttttgtgtg ctttctccct tctttcatgc aagcctgtgt tgcgatgcca 21420 tgtcagagat accttccttc actcgcctgg tgacacatgg cacccgtatc actgaatatg 21480 ctgcaaagcc tttatcataa cctggcatgg tgctgtgtag ctttttattg tctgtcttct 21540 ccagaattgt aattctcaag gttggtctcc ctgtgctact gagtgaccaa gggcccccgt 21600 gatactctag gctgatacca cggctactct agggctttag attataaaag acatcagagg 21660 gaggtgtcgg aaaaaacagt gaccccagac tcttctcttg actttgtgtt ctacctacta 21720 aatttccgct atcaaatata atccaaattt ggggtccttt gaattccccg aagtagcttg 21780 aaccgcccaa ggattgtctc aggatggctg tgtatctcac tgtgtgtgtg tgtgtgtgtg 21840 agtgagagag agacagagag agagaatgaa gaacgagaag agatttcaaa atagaagttt 21900 tcagtgtcag atgtatggga ctctacaaac tttaaataga aataatggta atctgccaat 21960 gtaattttta tcttcttttt ttttcctctt actaccagta gtttttcatt tgtctttgtg 22020 ttttatttaa atttgctttt aagcagaagc tcttgcatgc atttgcttct cgtagatcat 22080 gattgtatta ggttggtgca aaagtaatcg cagtttttgt cattactttc aatggcaaaa 22140 acagcgattg cttttgcacc aacctatact tccttgacat atggacctga agagggccta 22200 aaaatggcca agatccagtc atacacatta aagtgaccag gattgataga aactttccag 22260 ggttatggta aaatcatttt atattgggtg agcctgtcaa tattttaata ctaacttctt 22320 aaagattatt aagatgatga aaccaggtta tttttccata gttacttctg ttgcaagtga 22380 tattgtcact ttggtctttc tatatctggg caggcttttt ttttttttaa tgtctgttgc 22440 actgttatca gtgtaatcag acatagaagt tattttaagt attatatata gatctgttag 22500 acaccatagg gaaagagtta ggagagactt caaatagata aaattaaaaa ttcagaagta 22560 atgtcagaaa aactttctaa ttgatggagt ggctgtagtt ttatccttat tttctccatg 22620 gaaaaaacat agaatagaga cagagaagaa taaataaaat ctgatgggag aatttataga 22680 aaaggctgaa aaaaaaggtt tcagaaaaac aattcctgat gttatggtaa ttgtactaca 22740 tttaagtgac aagctgatga ctagttcggg gcagaggaag atgcagggtg agtagggatt 22800 ctagataatt tattgaaagg tgaatagagg gttgaaagaa gagaaaaagg aatgttaatt 22860 acaaaataag aaagatggag acttttgaaa tctaaattat ttgaaagcat aacttcacaa 22920 tttgaaggct tgaatcttat gcctgctagt catgaagtta aaaacatagc tttgaaaaat 22980 taaagtaagt gctggaaatt tgaagagaag ttactggctc gtagtagatt ttacagatcc 23040 ctcttggaaa tcaggtaaaa aatgggaggt ataggtaatc agaatgtcat aattaagctt 23100 gatttaatag ataacgcaag attatatgta ataaacagag gatatatgtt cttttcaaat 23160 acccatctat tttaaaattg gttatgagac cacacataca tacacacacg tgctaaaaac 23220 caaaaacatg caaaactcaa gaagttaaca aaagtagaga attgtacagc ctacattcca 23280 ttctctggca accaggcagt aagactataa attaaaaaag aaagctcccc aaaatgtagc 23340 tatgcagaaa tttaatatca ctcttctaaa aagtctttaa aaaggaaaca tataaattac 23400 ttagcaacaa gtcacactga aagcaataca ttatcaaaaa tcagaatata accaaagcaa 23460 tatttgattg aaatattcaa agcataatta aaattgtttg tataataact ctttaagaaa 23520 tgtgtttgtt ataagacaac caaatagaaa tgaaccaagc attaatctta aaacactaga 23580 ggaggctggg cacggtggcg cacgcctgtc atcccagcac tttgggaggc cgaggcgggc 23640 agatcacgag gtcaggagat cgagaccatc ctggctaaca cggtgaaacc gtatctgtac 23700 taaaaataca aaaaattagc cgggcgaggt ggtgggcgcc tgtagtccca gctactccgg 23760 aggctgaggc aggagaatgg cgtgaaccgg ggaggcagag cttgcagtga accaagattg 23820 caccactgta ctccagcctg ggtgacagag cgagactcca tctcaaaaaa aaaaagagtt 23880 agtgtaactt tgaaactaaa attggataag gccaacataa aaagagagag aactaaagat 23940 gaacattact caagtttata aatttaaaaa tcttaaaatg ttaggaagtc aaattcaaca 24000 ggatctttaa aaacagttta ttcttagaac acaaggatga tttaaaatcc aaccgattat 24060 attaataaag agaaagccag tgtgattgat catcttgaaa agttgaaaag catttaataa 24120 gatttaacct gcatttttaa cattcttaca tcccttctag ccgtcaaact ctgatcgact 24180 tacacttaag actgtgtctt tggtagccta ccctattcca cttttattac aaaggcagct 24240 cttaagacat aaatccatct cttcaggctc gaactgcaaa acacacgtcg agaggtgaca 24300 atgttgggac tgtcaaaaag aacttaaaga gtgggataac cagccgggcg cggtggctca 24360 ctcctgtaat cccagcactt tgggaggctg aggcgggtgg atcacaacat cagaagatcg 24420 agaccagccc ggccaacatg gtgaaacccc atctctacta aaaatacaaa aattagctgg 24480 atgtggtggc acgcacccgt agtcccagct actcaggagg ctgggacaga aaaattgctt 24540 gaacccagga ggtggaggtt gcagtgagct gagattgtgc cactgtactc cagcctggca 24600 acagagcaag actccatctc aaaaaaaaaa aacaaaaaaa aagagtggga taaccagtaa 24660 gatctaatat gttgggtttt aatggagatg tttaattgaa gcatatgttt caaaattctg 24720 attattacat actaaatatt ttgtaaaata aaaatgtaat agaatatgat aagaatatat 24780 taatgacagc aaattaacaa gagtaggaat taagtgctat ttgcaatcaa agtcacaaag 24840 cttttcttaa ttataaaaca atgtgtttgc cagataagca gtctaccaaa tgcagtttac 24900 aaactccaac ttaatctgca tacattcaag attcgccttc tcctcctcac ctctttgatt 24960 tattctcctc cacactgatg gggctctgag agtggaattc cttattagtt aatcatgttc 25020 tcttctcacg gcagttcctt ggggggttat actatgaaag tgggctagat ggattctctg 25080 agactcctgt ttgacccaag cacatgtggc gtataggctg tagctcccat gagacatgca 25140 gtgagataag actggtaaga gggagcaggg atgtagctaa cccggtagct ttaacaaaaa 25200 ccacatttct gaaagggaag gagctgtgcc cagagaaatc agctgatgac tcaggatcac 25260 agtagtgatt cctgagacag gagtcatatt attacagagg aacatagctg gaactggatg 25320 gaccaagttt cacatcacca catcaccatt tactagctgt gtgaccttgg gcaaataaat 25380 ctgaatctca gtttcatcat ctgtaaaatg gagcaaataa taactaacct tctggggtta 25440 ttgtgacatg agcacatggc atgtgtgtta atgaagtctg ttcttaggac agaggctgtt 25500 ttattttcca ttctgcttta agctgcttat tggatattgg catgtgggtg atagccacag 25560 ggagggtaga ggcctgcaaa tgaatgtttc ccctgagtga ggcacaggtg catgaagtgg 25620 tattctatcc aacaggaaaa ttaagtgtag tcataaggta ggtggcagaa ggaaggcagc 25680 attaagactc aacccatctc atctcgttct ccaacacacc cggtacccag ctgtggtaaa 25740 cagtgctagt atttgtcttc ctccatattc tttcttgcca agtgcctttc caacctcctc 25800 ccacgtctcc ttcctcccca gatattgtgt gccttcctgt tgttgcccag gttcccaaag 25860 tcctatcttt ctaagaaggg acctgttttt ctgaattatt ctcttcctct ttctatataa 25920 tcaaatctct aatttgtccc atagtgaacg gtacagtctt gaaaatggca taaggggttc 25980 taaatcgcca ggtacccctg ctgctcatcc aacctcatct cctgtttgtc ctctcactgt 26040 gcctcagcca cactgacgtt cttactcttc ctgggacaca caagcctgcg tcagcagttc 26100 cttctgctgg gatgctcttc tccaaagtat ctccgtggct cgtcccctca ctccttcaag 26160 tctttgttca gccgtaaact tctcaatcat gcccatccat tctcaccaca ttgtaactgc 26220 tacccacttc ttctaagcat tccaattccc tcttactctg ttcttgtttt cccctcatag 26280 cacatatcac cctttactac tatataattt gcttctttat aatgtttgtt acttatgttc 26340 tgtcatcctc caccaccata tcaactaaca aggtcaggaa cctgtgtctt tcctctttac 26400 tgacatatcc caagctgctg aagcaacagg tgctcagtca atgtttcagt gaatttacta 26460 gaactataag tatgaagagt ctcctatatt atgtcttagg tagacaagaa atagataagg 26520 aagaggaaag atacagcaat caaggtagaa aatgactaga ggaacaggca tggttttaaa 26580 aagagatcca ctgctatgat atcaagacct gagtggactg gaatgaggtg acctggattc 26640 ttgctttaag tttaccaatg accttagcca ggtcacttaa gctctggggc ctcagacgtt 26700 ctttatctac aaaatgagaa aactaaacta ggtgaccttt tgggtcttat cctactaccc 26760 aattctgtgt ttgggtttta ctttaggaag accaattaac acttacggaa acattgcctg 26820 gtactaactg gcatcaattt cacctaatgt ctcctgaact ccccatctcc caccaaaggt 26880 cctgggctgg gatgatgtct tttagtctat ttccatcttt tcctcaaggg ccttatgttc 26940 gtataggtct cttaaaactc aacccaagaa gctaaactgt ggactggggt ttcagaagcc 27000 agggctttgc tcctccatcg taaaatctga gtaaacccac attgttttat ttttcgcatc 27060 tctaaaattg agatacagca atgggtcggc agataatatt agacagaggg gaataaggtg 27120 catttgatgg agctctgcta gaaatagaag cacatcaggt gccatttcag gaaattacac 27180 gggaatttgg ctcattatcc atcttgctaa gctgccacat cgtaaagtca cagactatta 27240 atagaatacc attgataact cagagtgtgc aggtagagtt cctccatgtt gattgtcgta 27300 cacattgtct ccagtttctg aagtagccat gatttgtttg tgttttaagg tacactgttt 27360 tcaaagaaca gcctttaaag ctcttttgaa ttcaaagatc tttcagttac tcaaggaatg 27420 ggtccagttt tcagattctc cacgccatgg gtgcatcttc ccttcagctt tcttcatttt 27480 ggtcattgct ccagatgttc attgtttaac acaagtgggt agggagacgg agaagaaggg 27540 aatttcaggc tcatattttg gtgtcatggg cagatcttgg gtgaggtttt cattgttagt 27600 taaactaagt ttgaaggttt acactgaatt tctttcccta ccgtcctctt tggctactta 27660 aatagatcat caggactgtt tgttcttctg ttgaaagtgc cccacactgc cctgggcata 27720 agctaaattc tttctctccc ttttagaaga gaagccagat tgctccaagg cccgctgtga 27780 agtccagttc tctccacgtt gtcctgaaga ttctgttctg atcgagggtt atgctcctcc 27840 tggggagtgc tgtcccttac ccagccgctg cgtgtgcaac cccgcaggct gtctgcgcaa 27900 agtctgccag ccgggaaacc tgaacatact agtgtcaaaa gcctcaggga agccgggaga 27960 gtgctgtgac ctctatgagt gcaaaccagg tatgcacgag ctctgtctca gcagccttgt 28020 tcctttgcat cagagggtag cagatccctc ctcagccacc cctggcctct cctttcacac 28080 gaagatgggc cttctcaccg gtgtcctgtg aatctgccac tttgggctaa tggcagcttg 28140 acttctggcc ttctagtcag agaaccagag tgatcatcca ctggtacata agcgccttcg 28200 gagagcgagc gtgtcttttc ctcttttctt ctttttttcc ttcaaagtat gcttgactgg 28260 ggaggtaggg ttaaaattta tggtactatg gcaatctgct tcatttcccc agagaaggtc 28320 agagacctta tttgcagcta tatctgccac tttgaattat tacatttaaa aacaatcatc 28380 atacgtaaaa cagaaagtta tttttagatg aatacatgca tcaaaacgcc agaagtttca 28440 taatgcaaca ctcagcccca atctttctct ttgccaattc ccccgcatct gccctcttta 28500 ctttatagca tgtgccaagt catctttgac tgtttactca ccctctttgc caacatttga 28560 tgtactgttg tatcccagct gtttcttgct ttatattatt tctaacatcc ttgctcttgt 28620 ggcagcattt ccgatgtggt ctgtgaacac ttctgagtta tagcgatttc ctctttgcaa 28680 gtttctaatc atcggccagt tgtttttttg tttgggtttt tctttttctt tctttctttt 28740 ttttttttaa ctgctagata aacgctccaa gcaaagtgat cctttgtccg tctcttgaac 28800 tattctatca atcagaaatg cctctggcta actgtcacta tccatgtaac cactgctggt 28860 ttaattcttt ctctgtcttc attgtagctc ttaaatatct catccccacc tctttcctta 28920 cacagaaaat ttgaggaata ttgcagttct atagcattaa ctggagtaac atgtcgacac 28980 taggactggg tttgtggaag caaggagact ttggacattt gtcaagagct agtatttcat 29040 ttagggactc caaaagcaag taaatataac aataaacggt gcctctctgt ttgccccttt 29100 cagttttcgg cgtggactgc aggactgtgg aatgccctcc tgttcagcag accgcgtgtc 29160 ccccggacag ctatgaaact caagtcagac taactgcaga tggttgctgt actttgccaa 29220 caaggttagt ttgccattag tttgtcaagt tttctcctca tttgttagca tcatctaagg 29280 tacattttga gcatgcagtt tgttttccca tgagaaacct agtcctttcc tgtttctctg 29340 gaagaaatca ctgaactgtt tgcattactt ggtatttata tgtatctttc ctcagatcaa 29400 aaccaggtgt catcaggtta agtcgttaaa ccttatagat tctttctgta agggtgcctg 29460 tacacttttg agggctggga cctatataag caaatatgcc cctctgtata tagataggga 29520 ggcaggcagt gaggtgtcta tcaagagtag acacatcatt tgatcctctg cagtgaatct 29580 tgtaggtgct taaacaacat gttcaatgtg atgaaacatt gggacccttg atgtttggaa 29640 tcacacatct acccagaccc atttcttgct gaggtcaaga ggagtcttaa ttcttacaat 29700 gctttctgga ataacaaaca catctctctt tggcttgact gcggggaaga gccactgtca 29760 ttttaaccct agtagcaggg catgcactta gccccctaga actagttttt ctctagaaca 29820 tgaggatctc cgggtcctgt cctttttagt ctacaaagca ccaccataga gctgtccctc 29880 cttcgtactt ggctttgggg aagactgttc ttggaaatgt agggtttaca gtctgttttc 29940 ccccgactac tcacccctct tctctgccat gacctccctt ccatcttttg caggtcaaca 30000 agccatcttt atgactaagt catttcatgt ttttcagctt tcgttctgcg tttaatagga 30060 tgagttaaaa ctgtgcctta tttaactcaa tttccaacct cttcttcctc agtcccagtg 30120 ctaggcattt agcttctggg tttgttgcat gagtacctgt gattatgcaa atggttggat 30180 aagataggag tagatgtggt ccataggtaa ggtaagtatc agttatctgt ccgtaattga 30240 agagacatcc agaatttgtt ctgataatta ctttttcctt ctttcttctc ataaaataag 30300 ttaaatcctt ggcatctcct agcctttgtt ctgtcaaacc tagtattctc taagacatat 30360 cctttatact gtattataaa acagatgcaa agatttttgt gaagataaca tacattggtg 30420 cagccacctt cctatgacat tagaatatga taatttatgt gcagatgctt aggaaggggt 30480 gaaatatctt gcaaacataa ggtattacta atcagtaaag gagataaatg gagcatcgct 30540 aacagtaaca gatgtccctt cagggtcttc ctgaaaagga aggtggtaca ttatattgga 30600 gaacctaatt aatcaaaact aagccggttg ttaacgaggt agtcctttag tgaggaaaat 30660 ggacaattct cagttcacta ttataaatgc cacatagagc catctgcaca taggaggcac 30720 ggtcagctat tttttcagtc actctagggg gttgaaatgg gtagaagcta caggaaggta 30780 tatttagtct caatttaaga agaaactttc ccacaactag tgatgttctg taatgaattg 30840 gctgcttcct gaggcagaca gagctccttg gagctggata gatttctttc cgtctggatg 30900 gccacctgcc cggaatgctt acaagaattg tattaggttg gaaaggaggt tgaactagct 30960 gccttccatt ttatcctcaa gcttctgtga gtctagaaaa taaaacttga agatgattat 31020 tactgtgttg acataatttt gacttcagca ttcaaaaatg aagaatggcc agagttaaat 31080 ttgcctctaa tgaagacatt gaaaacatag agaaggaagt cattcaacat cccactgtaa 31140 attatttgca gagaagggat aggttctcac agctctttag tttcctgtat gtcagatcat 31200 cagtgaccca tactcatccc ctgctcattt gttcgctttg aacagggtct cctcctccag 31260 ctctctcacg agtcctgtgc ctttgcagtt gccactacct cttcccggac tcccttctgc 31320 atacatatag gcagcttcta cctctccttc aagactgcat ttcctgctct ttgaagtgtt 31380 gtcactcttc ggcccctgaa gatgacttaa gtgtccctct ctgctccctt gtcatccatc 31440 ccctgtgtat ctttttatca tagcactttt cataacacat tgaacatact gatgtttttt 31500 tgtctgtctc tcctgtcctg tgtgattgtc aaggaaaggg agtattttgt agcctcttgt 31560 acctaaatat aatgtgtcag cccacatctg ttggtgaatg aatggatgaa caaacaggcc 31620 ctctcttagc caagatgatt tatatgttga agaaaagaaa aagaatgaat atttttagtt 31680 tctagttacc ttcctgccat tgggttgtaa tactactccc tgtctcattt tcaaatcctt 31740 gcacttttcc cccagtcccc actttatggt gcggtactcc tcaatcctgt agaaaccctc 31800 cattccaagc acagcagtca tctctcacag catgggatgc ctttcactct ccattccatc 31860 tcttttcata ttgctcatcc agcaggactc taattaattt ctcctctgtc cacgtggtct 31920 tcatgcttca accaacaggt cccactgcct agaaacttcc cacataatcc tttcctatgc 31980 tgtcagttaa ttatcgactt tcatacattg ttactgaatt gtcgaatgcg tgaagtcttg 32040 tcttatcagc tatacgatgt tcctaaaggt caggaattgt gtcatctttt cttttcctct 32100 gtatgcatta gtgctgaatg agtcggtaaa tgcagaatat actcaagctg tggcttagac 32160 ccattcagcc tagtttagca tattcagtgg attcctgctt actaaggagg gtgatgggat 32220 gataggttgt acattgatta gagaataaca acccaaataa tcatgtgaaa atggaaaaaa 32280 aaaagcctaa catgtaataa taatggtagt attttaagag aaaaactacg taactttatt 32340 cttgtgtgtg agccatgtca aatgtccact ctttaagagc atatttaggc ataaagaaga 32400 agattataga acatataaga tcctattatg attatttgac ttttaccagc atacttataa 32460 gagcattgtt aagaaattag agccatttga gagcaaagct gattatgtgg agaaatcagg 32520 ggagaaagct atttacattt taggacataa taaggtaaaa tatctgaaac tttagctgta 32580 aagaattaca ttcagtgaaa gtaacaactc cctttttata agaatttatt gcactttcat 32640 gggctagaaa gtgggtttgt ggacttttct cccctatctt gaagaataca ttcctccctc 32700 cttcattccc atccaaatgg aattgccctt ccctatgaga ctatttcacc cattcgggcc 32760 cttaactgcc ccatttctgt gtgagcttga tttgttagtt ttctctggaa ttgtatgttt 32820 tcagttgatt cattcatcta ttcatcaaac atgaattgag ttcttaagag tgacagacac 32880 tattctccgg atgttcttac tgtgcatggg tttcactttc cacttctatc cctcacaaat 32940 actggttttc ccttctacct gtggcctttg tgagtatagg ctcttcctta tttccttgct 33000 gcctttctgt cttgtttaat ggcctttcat tttcacttct tttgtgtctt ttctcttgaa 33060 cttcatcctg tttaaacatc attcttgtgt gtgatgtaat taatttttaa ccattatttg 33120 attcttcctt ccctcaactt ggcatagatt tccaggcatc ccgccgtttc tttttatcct 33180 tttttcttct gttggcatga agacatgata tgagttatat ggcagtgcat tctacctttc 33240 aggagagggt ggagagggat aagaattagt ggctctggga cttatactgg gcctaaacac 33300 tacacagtga caaaaaggga aaatgggact gtctgtgagg tttaggacac cttgtggtcc 33360 ttgaaagtcc ctttgtgagt gtctgcatgt gtgtgcatgt acagatgctg ctgtgcataa 33420 gagaatttac aacagaaact taaaagacag atacattaat gcaaattatt tctttgcagg 33480 gaagaaaaaa accaagatag tctttaagtg acatatagat aacaatagca gtcacacctt 33540 aaatgtgtaa agtgccttgc tgtttataaa gcattttccc tattcatatc ttaccatcac 33600 tttaattcta tgaggaattt ataggtatga gaatggagat tgagataata agagacttgc 33660 agaaaatcac aaattttcct tggtacaaga gacagagcag gttcaaggct tcaacccagg 33720 ttttttaatt ttttttttat tgaaaaccct gtgctctttc cactggagct tttttgataa 33780 caatgttttg ttgtttttaa gtccaagaaa tatatcgggg aatatgaatt tatatctcac 33840 caacacattt ctatctgcat accacatacg tgtttcagtt attcactgct gtgtaacaac 33900 cgccccaaaa gaagtagctg aaaatgacaa agatttattc tttgtcctga ttctgtgtgt 33960 tgactgagca gttcttctgt agccagctgg caggttgact gagactggat gtccctccca 34020 tgtttggaaa ctcagctggg atggctggaa agacccctct ccctacatga tgtcagaagc 34080 aggagaggaa gttgcaaagc ctttggagac caagagttgg aagttacaga acacttctgc 34140 catgttctgt tggtcaaagg aagtcccagg tcaagccctc attcaaaggt ttggaaagta 34200 ggttccactt cttgatgaga taagccacaa agaatttgtg gccattttta atccacccta 34260 atctacattt aaccaagata gatgcccttt agttttcatt ttctgattgc aggaatttgg 34320 tggtgtgttt cagaaacgtg attcttttcc aaaaactcat gcccttctgg ccctggaact 34380 ctgggtcttc aggtgaaatt cgggcagcat tatagttaag ccccctgtgt gcccagcagc 34440 tctttttctg cttctttctt cacttgccat ttggcagtcc acccaggagc agacacgtga 34500 tttaagagcc tctgagccac cgcttcagct ttaatatcat ttagccaacc attattttta 34560 accttccgct tcttgaagag tgctgtgacc acctaggtta aactaggttt tctcccatca 34620 tacaaaaggg aatgctactt tgggatacca gaaaatctgc ctgataaaac cgggtccatt 34680 tttaaaacaa aacaaagctt gcaaagatgt tgaatagaga ttctcctctg taatggttga 34740 gcttacctga gagaaatgga ttcaccagag gcagatgcat ttgaaggatg gatccagcct 34800 gggctataca tctgagagtc catgttctcc tggtgtccac cagcccagac ctcagcccaa 34860 ctgggctatt tctcaagaag atagagagtt ctaagaccac agtgttttct ctgcacttcc 34920 tactccttgc tctctccagt gtttggacct aattttacag actcttcact ttgcctccgt 34980 aatttttcac cagtgtttca gatacaggct tacacggtag acattaacta gttaaaaatc 35040 tagacagtaa tacctccacc ttactcactt tcactgtaag tcacgagttt tttaaatacc 35100 ctacgccttt tgataagtaa ctttgtacct ttttattatt gacaagtcac ctattttaaa 35160 tttattttta tgaaatttgt cccaaacttt ttgttctaaa agttctgtta tcaatttgtt 35220 ttataaatat gccgttctta aaagttattg agaaaagtcc aatccagcta ttttaaggta 35280 gaaataaatt aagtcattgt ttataatgtc caaaaactta aaatagagtt tcttctgggt 35340 tgtgtgcatt tcagaaagcc agattcccaa gctccctcct ggctcctgcc tctctctgac 35400 caagggcctg atactcttcc agaacctata tctggaacat gtgtatctct tgtgttctga 35460 gacactagac caaaaatcct attttatttt taaatcaatc ttcagaagta tttttttgac 35520 ttgttttttt ttttttcctt agcttcaagg taattaaaaa tgttagatta tctgtcagct 35580 aatatgttga ctaatatgtg acttctgttt tcaccaaaaa atgtaataga atatgttgga 35640 cacatgaaat tctaagtaca atttcactag aaatattgac aagccagaat cacagagctc 35700 aacacaggag gcaggctgta tcactctcct gtttcataga gggagcattg tggagctgga 35760 agggtcctag ccggtgcaca tcagcggaac aggtctgtct ggcacttacc actgagctgc 35820 actgtccctt gtggccctgg gctcactcac caccaggccc tctccagtcc tgcctctcag 35880 gagccctaac attctgtgtc atgcactttg ttattgtctg taagcagaca catacctgtg 35940 agtaaaccat tccgtagcta aagaatactc ctctgcacat cacctggtgc tgttcagaaa 36000 gggcacgaat catgtgtcca gagctagaat ccaggcccac ttcggcatga gtttgggtga 36060 ggctctcaca ttgtctggct cggttttcca gtcgaagaaa ggaggggagg aaggatttga 36120 ttgtctcaaa agtaacttct acctctaaat tctttgactt tccaaaagcc tttagaaggg 36180 aaggaaccca agccgctagc gacaggagac tcctgggaaa caaaggtaaa gaaaaatttt 36240 aggaaatttc agagctcaga aaataatggg agagatatat ggctcgccta gggataagaa 36300 aaggagaaag tccaggggaa gttataaaag aattatgggt tttttcttta aactttctta 36360 aaaaaacaaa acaaaaaaaa accttagaaa cagcaaaatg aaataaacta ctagagtaat 36420 gatttaagcc agggtaatca gaactggttg gattgtgaaa gctgagtaac aattcagtaa 36480 aatagattac gtatttggtc agtggttagt tgactattgt tagttaacct tttttttttc 36540 tttacatcta taaatagata gtccctacgt agcagaagca gagaatatct ggccaaaggc 36600 tgactccgca agataaggat ttactctgtt aaactgcttt ctaatagaat gtcgatatct 36660 ctgcctgtca gcctgccttg gtaaatgtgc tggagagggc catgtgtgta tggttcctgg 36720 tgcccacaca ggctcacctg tgtgaaactg catcagccca tacatggcta cttgctggct 36780 cctggaccca tatttgcact tgtttctccc cttctgttgc cacatcactt tccctgaggt 36840 cggattgtgt gtgtgtcttc tctgtcttca gctgctccgt attcatattg ttcttttctg 36900 tgttttacat ttaatcgtgc atacacacac agaataatat tcatggacaa gttttaggtt 36960 ctcttttcct aactttatga ttggaccatt tctttctgtc tcttaaaatg aaaaaagatc 37020 tactcaaagt gacttttttc cttgtttgaa cctcagcttt atttctctcc cataaaacat 37080 tgaccttaga aggcaatttc ccactcatat cctgtcctat catcacagct ccaaactctg 37140 taactccagg aaaggaaata agataaaagg aaggatggta gtcccattgt gtctgaaaaa 37200 tatgtcaaat atacagccca tgaccagagc gaaggtcatc aatgtcgttt gctttcacta 37260 taacatctaa atatttgcac caagctgctt caatttgaaa ttgtgctctc aagaaaattc 37320 attttgttgg tgctgtcttg cattatgata atagaatctt agcagattgc tagcacactg 37380 tttcttgtga atagatagag atgtttagaa gaagttttct ggggttccag aattcacagc 37440 ctgaaattga ctcatactgg atgaggccag ccgaatggaa tctttggcag gttatccctt 37500 cccttcagaa gcacttgagt taaatccaat taatctaacc acatagctac agtgagtgaa 37560 aacaaggtac agattttaaa atgctccaat cggttaaaac agttaaataa ttacatactc 37620 atcgaggacc tcatggaagg cagaacactg ggcttggtgt ggggtatagt ggtgaataaa 37680 atgtggcaca tgctttaaag agtcttaaca tcttacagat atgaccaatg ccagtttcaa 37740 aaaggctccc aagttttgga aggtaataag aaggaaacac acaacatgca cacacacaca 37800 cagtcatctg ccctcaagga aagaatcaat taatagacag tctttctacc aaacaatgaa 37860 aaccaaattt aagaaaaacc atacagtaat atgccagctt taaatccaaa ctgagaaatc 37920 aaggacctca aaaagcacat atacatatgg aacaacagat gacaattttt tcttatccct 37980 cccttccctc ctgaaaactc aataggacag gagggatttt cttattcctc tgttctggaa 38040 catctgtctg cttctgagaa atggaacgag cagcaggatc aagaatcctt tagtgctctg 38100 tgtgctcaaa tgcttatcct tccgtggtag agctttccaa agtttgtcaa ctcttatctc 38160 cagagctaca gcccataact ctgcggtttg aagcctttag gagggtgttg ggaggtggga 38220 ggatcacgca agagtcagag gctgtcaaca cagtgcgcat caacccaacc tgacaaatat 38280 gtcaaagcca gaaacttcct gagggtggct tggaactgtc agggtcatgt tcagagctca 38340 gaaagcactt ctttttttct tcttggagac agttttaagt ttttttaggc catcaaataa 38400 tatttcctgt agatccatgt gctttttacc attgtgctag gcaattaagt tgcttatatc 38460 atgcctactc aagctaacca gttttaacta tccctgcata tgcattttca aagggacttt 38520 atctaattaa gagtgttgta attcattatc atgtccatct gttctaagat agtgtctcag 38580 agaagatggc acttaacagt ttgggttttt tttttttaag attgatctgt tagtagacaa 38640 atttccataa ttcagttcat ttttgttgtt gtcgttccct ttaatttgaa taatttctcc 38700 agctgctttt tgggagtaaa taaaattaag ttatataata agtaaaatat gcattagcac 38760 agcacagatt taaaacattc ctgactcctg acatactatt atggtagcat tctctctgtt 38820 tttaaacaca tgcagaaaaa aatttggaag tgtatttgaa gctcatcttt ctcaactaag 38880 aaaagaacat ctattctgtt gccctctgtc acagtaattt ttggcccaaa agagcgagtc 38940 aactcatgac taaaaaagcc agagtctata ttaaactctg atgtttttaa aacaaaataa 39000 ttgtaggaat tccatgttat tattggacct atccccagat tgaagaaaat atgctttcct 39060 agaagtaaag ccattgccca ttggaaatga actgtctccc tagtagagtg tttgatttgt 39120 caataattct cttcagatgc gtgcgtgctg tgtttcctgt actagggcct taggcttttt 39180 ttatctcacc ttaaatttga cctagtgtct agccttattt tgtatttgtc ttagtcccga 39240 aaacaagcag ggaaggctct cagtcagtgg cttccagttt aatccgagtg gtcatggata 39300 tcaccagaca ctgtatatgt ggccctccga gtcggactga gagaaggtaa cttaactgaa 39360 ttccgtaagc ctttcagtaa agttgaaagc ggaatccaca gcgtgataat attgaaactc 39420 agaggtctct ccataccctc ttttgctatg tcatcatcat ggcacatgca aacagaacag 39480 ctctgcaaaa atgctttgtg aaattacttg catagctaat aagcaagtct aaaatgtaaa 39540 agccctacct gtaatgaaaa taccaaaagc aacccaaatc acggaatata ttttactctt 39600 catgtacacc tttacattgg atatggctta ttgggttccc catcattgta gttgcaacag 39660 tttgcagata attcagggtt atatttggta gacatttaat tgctaattag aacccaggag 39720 gtagagacag ctggtttctc ttggccttgg tcagtctgtt gtgaatgtat tatacagcaa 39780 gtggtttgta aattattctt tcctgacttt gcttttgatt gaaaagaaag gaagttatat 39840 gtcatggctc ttatgataat atgatcaggg actttgattt ccatagctag atcatgaaga 39900 gttttaatga taattcctaa gacattagat tgtggattct ttgagggcaa ggacccaccc 39960 tatatatttc attctcatca gtacctagca cgggtaagta atgaatgctt ttaactaaat 40020 tagatagaca attagtctgc agcaaaagtc ctagcatcaa aaccccatag tgttcttgga 40080 ttgttcaggg agcagatatt ttgagcattt gtcatgtgcc aggcattggg ctacccagca 40140 ggaataaaat atagtcgctg atgtctaaga gcatatatta taaagggaag gcaaatatgt 40200 ataagtcatt gtagtttaaa gtgttgtgtg ctttggtagt agaagtgtgc aaaaaacaca 40260 aagttagcaa agcaaacaca tgctaattcc tatttggaga acggtcagat ttgacttttg 40320 aactgagtct ttaagaatga gtcagagttc acaagatgga gaaagagaga agggcattct 40380 aggctgttgt tgatttttcg acttgtaatg tgtcttgcag cttaaaacat taacactaat 40440 cccacctacc tttatgattc aagaaaaagg gaattcatga caatgttggg ttttcttgag 40500 ataaccactc cattgcaaga ctgcattact tgtttaaggt gttttgttac tttcctgttt 40560 cttacaagtg aagacgcttc ttagtatggt aggcaaggcc ccatattact cagccctgac 40620 ctccctctct ggccatgact cctgccagta cccccataat aaccccagtc ctcctctcag 40680 tacatagtgt acttctggtg ggctaaccta ttagtttaaa cttaacacac tgtattcttt 40740 cttgcctttg taaatttagt gtgttctttc ctctatctgg aatgccttcc ctctgtgccc 40800 actcccccac atttgatggt catttcacaa actgccactc acacccctca ccaccaccac 40860 acacaaacac acacacacac atacagcatc agctgtttat ctttgtgctg ctagttagag 40920 aaggcatcat ggaagaagtg atcgtcagga atgaaccgag gtctctgagg catgatgggg 40980 ggcttcaggg tagaagatgc agagtagctc tgacttttct ctagctgctc ttaacagctt 41040 gagtatcagc acagaggaag tcaaagggtt gaatcacagt ttgcagttag tagttcttgg 41100 ggctgaattt taccccaaga catgttttct ttaatgtgcc cctttttttc tatttttaaa 41160 ttagtcactg atatttaaaa actggcatac ttcacgtaaa aatctggctt ctcttgagta 41220 catggaaggt atgccctcat tgagtttacc ctttggcatg gaatgagtag tgatgccctg 41280 ttagaggtca cagaccttct tgaggtgtct agagtccccc tctactcttc taggattcct 41340 gcctgattca ggtttgccat ctctgggatt aagtaagttg ggtttgggag acagatacag 41400 aaggaggata aagaggcagg aaaggtaaag gcgttaggaa gatggtggtt caagtgaaac 41460 accgtgggtt ttgggctccg ttgttaaaga ggtacagtca ggaggaggtt caggccgcag 41520 agaaaaggaa acgatctggg gagtagatgt ctcagtgact ttgaagtgtc agtgtggtag 41580 agcatgggcc agggagaact ggaaggatgg accattgtga tcagcagata ggatgtgtga 41640 cttcagagag ttcagagagc tgtggctgca ggactgagtc accgaagtag agggggcaac 41700 aagggtcact ggattccaaa agatcaagtg aggccagagt attgaaaata ctttgtccac 41760 atagactctc aagcctctca gattataatc atataaaaca gctaacaggt gttgtgtgct 41820 tctgtgttcc aggcactatc ctgggtgctt ttcatcagtt cgtttaatcc acatagcagt 41880 catatgatat gtgggtactt taatcatccc cactgtacat aggctacaat tgaggcttag 41940 agagattact taacttactc aaggtcacat agttcctaag cggccagaac ttgaacctaa 42000 gactgtgttg ttgccaaagt gcattcccca aacctctgcc ctctaaagcg ttggatagga 42060 gacagagtca ccggaaggga aggtttgcat gagagaggaa agcaaccatg tgcaagtagc 42120 cagtggtcaa cttggagaat gctgacctcc cttccatccc aggtgaataa gtgagtagga 42180 tgtggggatt ctcagtctca ctggagatgg ctataaggaa atagaatctt taggggagag 42240 ccaggtaaat gttagaaaca aattcgagtg gtgcattgtg tgatatgttc ggaggatggg 42300 ggagtttgtc tgcagaggaa agtttggagg aagcagggga gtcagttgga agggatgagt 42360 cccagagcag tgctctctgg ggatgtgcat tccttggagg atgagagaca atacggtctt 42420 tcactttgca ctgatgacaa ggatgagagg ctgggaattg actgcccaaa ggaatccaag 42480 tggatgcatg cttaaagtca tctgtttcag cactagctca ggccccagat ggttctggcg 42540 gcctgaagga gtgttagcag ccaggcaatg gaggctttgc tcagattttc aagtagaagg 42600 ttagtccaga catcgttggt gtttgttctt tgtgtgctct ggggagagga tttaatagtc 42660 aacctcccat tgtgttctca tttgtggtta tatataaaag cttccctaaa ctaaattccc 42720 taaacttagg gccagaaccc tgtgtatccc cagccgcaat ccagcttagt acctgctgca 42780 tgtacacact cggtaaacag ctacaaaatc attacgtgtt cataaacaag ctgattcatt 42840 gttccaaagc atctattttc tgaggcagtt ggaaattcca ctctgaagtg ggaataacct 42900 atgttgctaa actaaaactt attctccact tcccagccca gttctccttt ccacagcgtg 42960 atgtgccagt ttcccttcct ccttttccaa actagtcctt catgcagctc tcactcccat 43020 cctgtcttac tctctaaagc catcctacca gccgtgaaag cagatgccat tgtcttcccc 43080 tagacctacg gaagtgaagg cagggagcct tccgttcctg ggacttggcc agctctcgat 43140 gtgtgatatc ggattgttga gctggactga gtggcccagg atttctgcca gatcccagct 43200 gtaacatttc aacaggggca cgggccctgc gattactttc agctgtgttt ctgctgtgtc 43260 tgtctccacc cccccaccac ccaccgctgg ggttgagtcc cttcaaaact aaccctgatt 43320 tttttttttt ttagccactc acgtcaaggt agtaatttgt gaaagatgcc gttgttttct 43380 ccccaccaca cctactttct gtggggcaga gtcctagaag ataattgtta aattcttata 43440 ggaaccacgt agacttacaa cttctgttct cctgcagctt ataccacttt cctttcaact 43500 tagcacctta agataagctt ccggagccac atgccagcct cgaggaggaa gctaggaaga 43560 gaagagctta atggctgctt tattattatt accagtagta ggaaggcaag ggaatatttt 43620 attttatttt tatttatttt ataaattttt tatttccata ggtttttggg aaacaggtag 43680 tatttggtta cataagttct ttagcggtga tttgtgagat tttggggcac ccatcacctg 43740 aacagtatac actgaaccca aacctaattg tctttaactt ataccttccc tacatgtgag 43800 catgtccttt gattcctccc cccagaaaaa aacaaagcgt aaaacaacaa caacgaaagg 43860 cagagtgatt ttttctggtg gtcttcaaag ggaaagtgtt gaagaaggta cagtttctgc 43920 tcactgtaca caggggaggg atggctgcgg ggggagagtg tgatccccag ggcagggctg 43980 cacttgctgt ggctacagca catctgtcag tgttaggggg tgtaaatgcc aggaaatcac 44040 gcaggaataa gcttgagaca ggggagaaag ggtgaaggag ggttagttat tcatacacag 44100 atagtagtaa atctcttatg gtgaggcacg aggaaaactg tgagcctaag gaactttcat 44160 agggaggagt gctgttttcc ataagaaggg ttggaccctc agataggaaa cctgtttctt 44220 tttggatatt ttatttgcag cctttaaaaa aaaaaaaact tacacattcc tggaaagagt 44280 aagaaattta atcagtagaa agatagtaca tcgaatttaa ttaaagttgc agctactttt 44340 tatagaatag cactgtgttt tatatgtcat tataatttcc attcccgaca agctatttca 44400 ttaacacact gtgtagaatc ctagtaacac aagggctttg ccctttttgc tcctcattta 44460 tccttgtttc ctgactaaag ggcgacctcc cacccacctg taagtagttt ttaatttata 44520 ggcggtcatg tctccacttt tgaaggcctc aagcactcac ttccggtgtg tagtagatcc 44580 agtattgaaa gaaatctttt ccagaagacc tcactgactt gttttcctct aggatacggg 44640 cagaggaaga tcagagtctg ctggggaaga tgaggaatat gatggcttgc tgcttccgag 44700 ctgcctcctc tgcagaagga gcctactatg gggaatgtta ggataacatc agcacccaca 44760 ccaggaggat acctctgggg tgacggggcg aggcagaggt tgaggggcat tcctgttctt 44820 tgctgaaagg gctgtgggag gggcaaacag gactcctagc cctgtggcct cctccaggaa 44880 tctcctgtag gttatcttcc tttaaggttt ttctggtcat tgtagtaaaa caaactgttg 44940 acccttgcgc ttagttatgt agggtctgaa acatgcctag tggtagggag acaacagcat 45000 ctagcatgtt gtttcttttc cggccttgct tagagttctt tagtagggaa aaaaaagtta 45060 tataattatt atgcccccac gaaaaacaaa acgtcaccag tgcaaaggcc ctatcttgag 45120 ctgggcctga aggttgaggt gcttggagag gaggaggatc aggggagtaa gggacacaag 45180 gtgactgatg ttcctgaagg acagggtgca ggtgagtgag aagaggatga gctgaggtca 45240 gtttgcagca gtgacgggaa tgggaatggc agtgatagaa tcaaaggatt ttatatttag 45300 agaagagttt attatgatcc acctgtctca ctttatggat gagaaaacag tctgcaagag 45360 tttaagtagc ttgctagcag tcacacattt ttcttctttg ggtgacagta ttggataaat 45420 tggtaatatt tctgttcttt tcttccctcg tggtatctct cacttttaag caatggtatt 45480 caggcaggaa ggagtctttg taagtgatct aaaacaagta ctagtaatca ttgttctcat 45540 tataatgtta aaactagttg tacttgtttt ttgctttttt gttttttttt agtggaaagt 45600 acaaattaaa ctgtgaccac ccaggctttg cgcatctagg cacaaagttc ttggaaagtg 45660 gattgcaaat tcccatcagc agactctcct ggagagccta ttatatgggc agattctgga 45720 gccgctcccc aaacctgcag aaaaacaatg tttgagtggg taccctggca tctgcatttt 45780 aaacacattc catagtcttt ccttccacat actaattttg aaaagcactg tacctagaag 45840 cttgcagtct aaaatgtata aaaataaaaa atataaaaca atacatggtc taacccctga 45900 cttagtatcc atgaaaaagt ttatttcaga ttattggttg ggtaccttta tcattttttt 45960 actattattt ataagacaag taagacatgc cccctgcact agaggaattc acaggagaaa 46020 cagtctgatt ttgatttagt atgatgggcc gatgcacttc aagagttact accgttacgt 46080 actgttgcaa gtcagaggag ggaaaattaa tcctgtttgt tttttgacca gagaaggtct 46140 tacagggtag atggtcttaa ggttgtgggg aattcaccag gggcaaggga gaggcaggga 46200 tttgaggctg ggtggacatg agctgtggct ggagtgataa ggggtggtgg caggtgattg 46260 agtgacatca gatcatcatt gttgctggag tgtagggtgt gtgcaggaac cagtagaata 46320 tggaaactaa taacaggtag aggcagactt aggagcacct tgcaagctat gcccaggagc 46380 tcggactctg accttctggc ctgcctggga ttgtctgtga agtccaggaa gcaaaggaca 46440 tgaagggaaa gcttaaatgt gtactcaccg cccttcagtc agcagaacct cctaaggggg 46500 tgagtgccct ctaggggttt ccatgatttc tgagttccat tgtactcagg cactcaggtg 46560 cccactgagc ttgaatccat tacctacttg agaaaaaaaa aagacagctt taccaagtcc 46620 cacctggctg cagctgtggt tacattatgc aaaaatttat taatgtggta cctcgattga 46680 tcattatatt aactctaagg attacattag aggctttatt tttctggtta cttgcaaata 46740 caggttgcat cttctttgaa gttaaaaagc ccttagccca catattgtat gattccattt 46800 atatgaacta tccagaattg gcaagtccat agagacagaa cataaattag tggttgctaa 46860 gggccggaga tcaggggaaa gggagtgact gctagtgagt acaggatttc tttttgagat 46920 gatgaaaatg ttctgaaatt aggtactagt gatggttata caaccttgtg agtataaaac 46980 cactgaattg tgtactttaa aagggtgaat acttatcgta agtgaattat atctcaataa 47040 agctgttttg tttcattttt taaaacctta gcacttctgc tcaaggtcat gcaaatagct 47100 ctgcctgggg tggcctttat gaattcacag tagagaaatc tcagtgctct cctcactttc 47160 ttttgtgttt ttgttttgtt gtgttactgt tattcctgtc ttgttcctct gcagttcact 47220 tccacttaag aaacatttat atgactagac atgataggta ctagaagagc ttatgttaag 47280 gtgagtcctc aagttaccac aaaatagaga aagagggtgc tgtaagaaaa gtgtggtgtt 47340 tgggggttca gagaagaagc tgttatccga ggttggccta gaaagacaac tgggcataag 47400 ggctggatga ggaatgtgca gaggccatta gggcaggcag tgcagagaca gcaaagtgcg 47460 gacttatttg gagaaaagag catgaccgta gcaatggaaa atgaagggaa atactgggaa 47520 acagaattga gaaagggtta tatggacttc agagtgttga atgctaggct gagaaagtga 47580 tacataaaat gaggagtaat ggaagataaa ttttgactgc agaagtttgg gaactgagcc 47640 ttaggatggg tgtagcatgg tttgggagag acagagacgt gggcaggggg agattggact 47700 agaactggaa attggggaga gggcttcaga aatgaggggt aagtgttgaa atcagctcac 47760 tgcagctttt ctcatgaaat gctgtttgct ctgccccttt tgtgttctcc tttcctttgc 47820 ttgctcctta tccagagatg gggataaagg gagggaaaga ttagggagga caaaaagaat 47880 taaaatgaag agaaagaatg tggtgaggac ggtacaggtt gcatgcgtca gttcttctgt 47940 agttcagcat ttcctgttca tgactcagta gaggaaaatc tagtctcttc tagtggaaaa 48000 cacagagaac aggaaaccct atttaaaatc aggtcacttg actaagctgt gtaactcctg 48060 ttcccatcgg tgaaagggaa gaatacctcc tgcagtgtgt gttaaccaga gaatctagaa 48120 taatcttact gaagaatttt ttaaaactac atgggactca tatcatttgt agatataaca 48180 gacttcctgg gcaaatatgg gtcatgatgg ggtccccttt tgttacctga cagagtgaat 48240 gtaaacatac ctgttaaaag ataactcatg tgagttgtat tatgttgaaa ataaggaatt 48300 gctatttgac catttcgaac tgcaaaactg gcagtttcag taaggtttta acctaataga 48360 tagctaccca atagatatct gacctaagcc cagattttta ccttttgtgg ttctttcatc 48420 agctctttct tgattaagat gctaatttca cgttcagggg aagctgtgaa aagtcaaaag 48480 atggcaacag gtagatcaga aggctttgca gggaaggaaa caaggagaca ggagctgggg 48540 tcacaccagc tagcaaaacc aggatcttta ttatgcagtc agtttagttg taaagattca 48600 gcataaaaca ttttcctttc ctaggtgggt taataggtgg ttgggtgcaa cttcagctct 48660 tctgtagcct gctggagaca acttgtaagg gactttaaaa aaaaaatagg aatatttaaa 48720 tgttaggaag acgtaatgca tataatgttt actctttggg tgacaggtac actaaaagct 48780 cacacttcac ctctatgcaa tatatgtcta taagaaactt gcacttgtac tccctaaatg 48840 tattttttaa attgtaaatt ttttaaacaa atgttgggaa aataggtcaa acataaaagg 48900 ggaaaaaaga cttcatggca gacttagtca tggtgttcag tgtggtgtga ctcctaacag 48960 catagcagag aacccagcac gtaagtgcac gtgttcagta aaggttgttc aagtgttgaa 49020 ggacaacatc tgaagaggta ttttttggtt gactgtacac tctcaggtgt ttagagtaga 49080 atgattccct ggagcacgtg tgtaggcaag aatgtggaat catagaagat ttgtaagaga 49140 aggttaatca gctattcttg cgggatgctt tcattgtaat cacatctaca cattggggcc 49200 agggtgtggc ctgtcagtat tctagcccca tttatctata aactccagag agttttaagt 49260 tgtacagagt gccaagtaca ataaatgtaa acaatataat gttactgtta tatcacagca 49320 gtatgattat gtgcttacca ctgcaatggc tagggatata agcgtgacaa agtatgggat 49380 gaccttatca ggcactaaaa gcccagctca catttactca gcgctagaaa atgattgtcc 49440 tgcaggaata tagcgtatga attgagagct ccaagctcta ctcacacaaa acaggcaaac 49500 ctttacttga aaccactggt agaaatacaa ttgcagcaaa acctgaatga cctaacaccc 49560 tcttctccca ggaatccaat agaactgcca ccccacatgg aagaatcctc aaggagccta 49620 ccttggggtg atagtggggt gtttaaagcc acctttccag tggctttaat agtccttgat 49680 gcaaatcata acctaaagta atttcttttc tagttccatg gttaactata aatgtaatta 49740 aatctctggg cctcattatt gaagtacttg tggactataa gcatatttta atttattaaa 49800 gatccaaaga acaataagtg aacaaaccaa ttacttaaat aagaaggggg gacaaaagca 49860 caattaatga atgtattgat agttttccct gctgggaaca ttacctttcc tgatcttgac 49920 ctctaccgta tgtaaatacc cacatacttc ctgccaggat gtcttttttt aggatgtctt 49980 ccttttactc ctgtcaacct 50000 22 50000 DNA Homo sapiens 22 gggaactccc gccgtcatct tctccaccct ctgacccata catctctccc caggggagtc 60 ccagggcaga aatctagcgg ctctttcctc atcttacatt cacatgctga cacccagctg 120 ccctaagttt cctgtgtcct gcctctcttt cattgtagca gatgcaacat tttgtctcct 180 cttatttcac ctgcccgtat ttgccaggct tctgcgttta gcaatttttg cagcagcatc 240 gaagttaggg gtgagagcca gcacagccag cttgcagctc atggaacacc cccattcatg 300 cgtgcagaga gatcgatgct caggtgggag cttaagatcc aaaaaattaa gactttcagc 360 ttccaagata gctgtcataa agttcactta ctacagctta ggccctctct ttataaagag 420 ctcctcagtc ttcagtccgt ttcttttatt gtgcacagtc ataaattaaa tattgctcaa 480 gatgagagtg gacttaagac aatcacactg acaagaaaat actaaaaagc taataactta 540 atcattgaac caggagtcag aaaatctctg aaaaacccta aacatcagtt gcagcaaagg 600 agcaacatga ggccatttag cacgggaaca ctcactctgt attagcctgt gcaagcatcg 660 attcacagaa gagcataaca gatacatatg aagatgtccc tgagtcagcc agaaaatgtt 720 tgttacgctc aataccagtc tggttaagta agaaatgagg taatgcattt tagaaatatt 780 tctgtcaaaa tggttaaagt taaaaattgg ggtgtacata aaggagacaa gccttagtta 840 tttgaaagca aattatggta gaacccctcc tgccttccca tttcccagtg acaccagata 900 aaggaggcag tgtcgtatag accaggtact ttgccacagc cactgccact ttgtttgtgg 960 tctcccgact tctatgttgt tgtttattca tggggttttc cttccctttt ctttggtttc 1020 agatgcgagt gtctctctgg cttatgtggt ttccccgtgt gtgaggtggg atccactccc 1080 cgcatagtct ctcgtggcga tgggacacct ggaaagtgct gtgatgtctt tgaatgtgtt 1140 aatggtacgt ggggtttctc ttgttctcag agagtgtaca tttgtgcaga ggaggaggag 1200 ggagggttca acttagctgc ttctgccttt tacaaagcag ggattacctt caggggcttg 1260 ctgaagggca ctcaaaaagg tttgcttaag atccacagtg cagtaaagaa agttaggtta 1320 ggaactatca ttccacccat cagaaggagg attctggaaa tgtaatagca gattgcatgt 1380 agctgtcata ctcagctgtg ttacagaatt tccaaaactt agggatttca ggaaaaggat 1440 cctggagcct gtgagagttt tcctgctgtc actcagtaac tggtcccatg gctgtcggga 1500 ggtgacctgg ctaaaactgc attgggcctg ccacacttgg gaacaagaaa aacatggctc 1560 atcttgttta gcattttcct acttagaaca tctttgcacc tgatcttagc caaaaggctg 1620 agaggcggtg gggaagacat ctttaatcag tggttggtca ctgatagcgg cctgctgtag 1680 gcttgaccca cgtggccctg aaaaaggatt cttgggtcac gagattcctt ctgacagaat 1740 ctgcctctcc ctttcacttt atatgggagg atttggtgtt tggtggcttg atgttgattg 1800 agtttaggtg ctagaaggaa ggggaacctg caacatgctt ctctaccaga atattcatta 1860 acaaaaaata aaataaacaa gttacatgtc aataaccatt cagagcatta tttagtcaat 1920 acttgtatac acaggaagat ggagatggat ttggacaaat acaacaaaat tacacataga 1980 aatattcata tcaaatactt cctgtaggaa ttcagatgtc ataaggtctc ctgccctccc 2040 aaggcaagaa aggttatgtt gcagttgagt cctatttggt tgcatcctgt gatttttagg 2100 cccgtggcca agttgttttt tattgtttat ttttgctttg aagaaacact tgcagtatgg 2160 ctttatggtt aagaaagaac acagactaga attaataacc tcagtttgaa aattgggtcc 2220 tccactcact agctgagtta actttgagga agcttcttgg tttccctggg cctcagtttc 2280 tacatctcta aaatggactg aataacagta aattaaacaa ggtattatag gtgaactacc 2340 ttgaattttg cctcagatgt aaataatctt tagtattctt tttttttttt ttgagacaga 2400 gtctcgccct gtcgcccagg ctggagtgca gtggcgcaat ctcggctcac tgcaagctct 2460 gcctcccagg ttcacgccat tctcctacct cagcctccca agtagctggg actacaggcg 2520 cccgccacca cacgcggcta atttttttgt atttttagta gagacagggt ttcactgtgt 2580 taaccaggat ggtcttgatc tcctgccctc gtgatccacc cgcctcggcc tcccaaggta 2640 ctgggattac acgtgtgagc cactgcttcc ggccaataat cttctgtgtg cacactttct 2700 attcacatcc agaggcacag ttctggacat tggggatggg ggatagagat ggtaaggttg 2760 aacttcctga cttacagacc ttataaatac aagcatacag gaaaggggaa cagaaaaagt 2820 tctgtgcact taaatcacct ctcggtgctt ctccccactc tggtcattat ttagccagta 2880 atcacatcca ttaaacacaa gcaaatggcc aagcagacct tttttctgga ggtgctacag 2940 tttgatttat gataggtgag attctgtgaa tacttgtacc actggagaca gaaatagtca 3000 ctgcttccct gggttcccag catctcagat atgcttcatc cagtgctcat cacttggaaa 3060 agacttaacg ctcatgcctg tctctcactc caaaatgtga actgctggag gacaggagtc 3120 tgtcacattc atccttgtgt ctgtagttgt tagcctatta tctgcctcac agaggacacc 3180 cagattttgt tgcatgagcc agtgaatgat ctaattcgta tcatcttagt tccctctaga 3240 ggtgtctgtt aaatgtcagc ccataaatgg gttactttgt aggaatcctc catctattta 3300 ctgccctaca ctttaatctt aagtcatttg tcccctgaaa atatcatctc tagtagaaag 3360 atttgctggt gtttcctatg ggtgaggaac cggagtgccc ttcttctctc cttgcttctt 3420 ggttctagat acccatgggt cattcctttc ttatcagaac atagacttta tgtcttgctc 3480 cagtctgatg tgtctcaaac tactaatagc cgtacctcca ccttagcaac cttgggatca 3540 gtctgcaggg cctgtcctca tttattgcca cttattatag agccctgact tctggaatag 3600 gatttgccat ctgtcctgag aagatgttga gtggacaaaa cacagtcact acatctctgg 3660 tcgttgcttc tttcttgcca ccataaactt gttgagtgat ctttgctttc ttcctaatcc 3720 tgtggaactg tgtctattga tatatcagtc attcacctgt agaggagact aggaactaat 3780 actgtgaaga ttcataatcc tacagaatat tgttaagaac attgtctcaa agcctgcatt 3840 tttcttcttt tgatatatac attcatactc atacacacac atatatattg atagatgaat 3900 gtgtctgtgt atgtatatat aaatatactt accacctagt gtcctggaaa aatctgttgt 3960 tgtaaggtca ccattagaag atactacata tcaaagattc attatcatca cttagaaaag 4020 cagaggtcta ttttatatgt ataaaaccac tcgcttatac acagaggatc tgaggaaact 4080 ttacagtgag aacagattct ataagatagt gataaaatat ttatgagaaa ggagactcac 4140 agccatagaa ttgtaagtaa aagttgaaag tgaaaaatca aacagaaagc acccacgggt 4200 atacagtcca tcagtgccta tgcagttgtg ggagttaaat gttaaattag gcccccagtt 4260 ttctggccat caaagtgaaa aaggagatag ggtcaaatac gctgttctta ttttcaaaga 4320 ggaaaaaacc tgctggatgg gacaactgtc tctctaatag agatttttca tatggaaagg 4380 aatggatttc atcccactga atgatgaagt tgatcatgtg cttaattgcg ttaacaccac 4440 aaatgcagaa actggtttcc tgggattgtt ctcaataaaa agcaaaggca aaatattaaa 4500 atacaactca gttccaaaag gtcagattga cacagctaga ccgtaggtgc ttggaacatt 4560 gtttctcaat tttgcaaact gcccaaccat ggtgaggtag atccaaagac cccagttcag 4620 aagactgtgg gtagaatgtt aacatgactg accccacatg cagaaatgga ttatgcactg 4680 ccgcactcac ttcctatgct gccatccatg tgatccatat gcttatgttt ttttcttttt 4740 aaaatcaact ttaaagtgaa aacatacaat tttaaaatac acagaactcg aaaactcgtg 4800 agttttgaaa acactagcct gatgagtaga tgcagagaga gcatcctcca taaatattac 4860 tttgtttttg tagtttgatt ttattttgtt gtggcggagg gatgggagag ttgcagacca 4920 gttcatgaca gtttgtaact gcattctctg acaagcgccg aaaggagaac ttatgaggct 4980 ttggctgaac aagatccttg ttatttagaa accaaaggaa caggtggtat gattatgttt 5040 cttttatgtg aacttagaag tcatcaaaat catcacctag aaatatgacc tccaaagcag 5100 tgttgtaaca gccctccacc ttcgtcaaga acccactgta tgtctgctcc tgcccacaaa 5160 caaatgtccc atcagccgtc agctggagct ggacaatggc ctctactgtt agatgggtgg 5220 atgctctcca cttaacacag taccccctgc tgtcttacac atggcccacc tctcgttcct 5280 ggaggcattt gggtttgaaa tcctaggtgt agagtgtgca agtgattatt tggtttctta 5340 gatacgagca tttgtccaac gtatatacac aaacatctag ataaaatgga atatcttggt 5400 tggaaagtcg tagccaaaag ctgactctgt ctgtatgacc tacttataaa acgttcccca 5460 agtgagtaaa tgaattagtc tatagtattg ttattggata gaagccagtt gaaactacca 5520 gaaatatttt tagccttacg gtatttaata ctgaaccatt tgttctgaca gcatagttag 5580 cagggccaga tatatattga ggctagcccc cattcacagc tctaaacaag tcatcagcac 5640 aaaagaggaa gagtaccctc ttcccagtgg tgccaagagt ggtgggtgga tggggtgggg 5700 atgggagcat tacttgtgta tggcacaact agccaaaagc aacttcaggg ggttatgaag 5760 taagttcaga gaaaggggat ctgaatacaa cttagcgtgt gtttttagcc tgttttgtgc 5820 catttgaaat acttctgttt tggagcactg tccctgtagc actctgcttc tcacagcagt 5880 attaaaatgc agcactaaca gtgaggagcc tggatgaagg tcatgttgcc cacccagtaa 5940 ttagttcctg ttttattcag gcatatacga aaggctaata aaaactggaa gtaatcttat 6000 gttggccctg acagaatttt ctctaagaca gtaacatgtg aaatcaccat ttaaagggga 6060 gccgtaccca gcatcagctt gctcctcatg taaaattttg gcctagaaaa ttcagacctg 6120 caggctgtta attttcaaaa aaaaacacta gtacagaaaa cattaagaac ccactgtcta 6180 tatagcaatt atgctcataa tgtttctaag gacgtaatgt cttctactga tagaataatg 6240 gagatattct ggcactttga gagcaatact ccactcagta tgtgtttatt gagcacccct 6300 cgtgttcctg gggccttgtt aagctaattt tgttaaagag tgatacacat tgttgtaatc 6360 ctttctaagg gttgttctct tccccttttc ctgaaatagg atatgcttgg tgtcctccag 6420 cacacacaca cacttcccct cttctccctg tggaataagc caattgaaac tagcaaacat 6480 atttttagtc ttatagtatt taataccgag ccattaatta tgagagtata gttagcaaga 6540 ccagatattt attgaggcta gtccacattc acaactttaa gaacatgtca tcagtataat 6600 taataaaggg aaaacttaag gatttaaaaa cacacagaac tcaagaactc ctgagttttg 6660 aaaacactag cctgaatgaa cagatggaga gaacatcctc catgaatagt ccttctcttt 6720 tcactttggt tttaatttgt tttggtggag ggatgggaca agctgcatgt cgaagctgca 6780 gagagttatc tagaagattt agctatgaaa agtgatgaag gtggctacac taaacaacag 6840 atattcagtg tagaggaagc agccttctat ggaaagaaga tgccacctgg gactttcata 6900 acaagagaga agcagtcaat gcctggcttc aaagaatagg ctgattctta ttaggggcta 6960 atgcagctga taactttaag ctaaacccca cggctgttaa gaattatgct aaatctactc 7020 tgcctgtgct ttataaatgg aacaacaaag cctggatgac agcatggttt attgattatt 7080 ttaagcccac cattgagacc tactactcag aaaaagagat ttcttccaaa atattactgc 7140 tcattgacaa tgcacctggt ctcccaagag ctctgatgga ggtgtacaag attagtgttg 7200 ttttcgtgcc taacacaaca tccattctgc agcccatgga ccaaggagtc atttcaactt 7260 tcaagtcttg ttaattaaga aaaacatttc ataagattag ctgctgtaga tagtcattcc 7320 tctgatggat cagaactaaa tcaattgaaa accttctgca aaggattcac tcaccattct 7380 agatgccatt aagaacacag gtgattcagg ggaaggtgtc agaatattaa catggagttt 7440 ggaagaagtt gattccaacc ctcatgaatg actttgagga gctgaagacc tcagtggaag 7500 aagtaactgc agatgtagta gaaggagcaa gagaactaga attaaatgtg gaacctgcag 7560 aatgtgactg aattgctgca atctcatgat aaaactggaa aggatgagga gtttcttctt 7620 atcgatgagc aaagaaagta gtttcttgag atagaatcta ctcctggtga ggatgctgtg 7680 acaacatctt caccagataa aacgacaacc agggatttag actattacat aaatttagtt 7740 gataaagcaa cagcagggtt ttagaggatt gactccagtt ttgaaacaag ttctactgta 7800 agtaaaatgc tatcaaacag catcacatgc tacaaagaaa tctttcatga aaggaagagt 7860 tagtcaatgt ggcaaacttc attattgtgt tatttaaaga aattgccaca gccaccccaa 7920 ccttcagcaa ccaccatcct tatcagtcag cagccatcag catcaaggca ataccctcca 7980 ccagcagaaa gattactatt tactgaaggc tcagatgatc gttagcagtt tttagcaata 8040 aagtatttta aattaaggaa tgtatattgt tttttagaca taatgctatt gtatacttaa 8100 tagactatag caggggtgtc caatcttttg gcttccctgg gccacattgg aagaagaaga 8160 ttgtcttggg ccacatgtaa aatacactaa cactaatgat agctgattag cttaaacaca 8220 cacacacaca cacacacaca cacacacacc atcttacaat gttttaagaa agtttgcatt 8280 caaagccatc ctaggcctca tgtggcccgc aggctgcagg ttagacaagt atggactaca 8340 gtacagtgta aacataactt ttaaatgcac tgggaagcca aaaagtttgt gtgacttgct 8400 tgagctggtc tggaactgaa cccacaatat ctttgaggtg tatctatcac catccaccaa 8460 tcagtgtagt ttggtaggca gtagcatgta ctaatagtct atattacaaa attctctttt 8520 aggagttctg aagtatattc aagaaggaag ggaagatccc agaacttaga aacttaaagt 8580 ttaattttgc agacgcaaaa cacatacaaa aaatcactaa aaaggcactt ctgtgcttca 8640 tattgagaag tacaaattac acagtaaatt atttcttcat aacacttttc cacatccact 8700 taaatcatcc ttttgtggag attgtggcaa tctccacact gttctgagat tcacatctac 8760 ccctgatagg tgaagaagta attggtcatt gctcctcaaa aaccacaggt agccttttta 8820 tcatatactt ttcttaagag cctctataaa tctctttgaa catcaaacag atttcattgt 8880 gttgtgtgac ttttttcacc cttaaagatt gattgtgttt ccaggctgtg tttttaggat 8940 tctccatctt tctcacttcc actcccttga tccttgtgct tttgctagtg aaacctaagt 9000 gccacagttc tctcgtcagt cccatcccct cctgcttaag cttctgcaca ccctaccata 9060 aaaacaagtc cacttgaagc agctttattc ataaccagaa cctgccctcc acacttcctc 9120 agatcatgca ccagggttgt ggcttcaccc agatggtagc tgtggttgat caactcaacc 9180 aaataattag tattgattgg tttatttttt ttagtttcca tggatgttca ttttgttaga 9240 gtacagtatt ttgcctcgct ctacagcctg caaatttaat gtataaaagt actccgtttt 9300 ctatactttt tggaacttat agtgtagctg cacaattcct ttcaatccca actcagatta 9360 gtttgttagc taaaagtata tagaagaatt ctgagccatg gtaagatgat tcttttttaa 9420 cattaatttt ctgaaaaaat gttgtatcca tgtagcacta aggaaggagt ttaaacattc 9480 acatgtagta cgtgtgcaca cataaaatca ctgcctcatt catgtgactt taagcaaatc 9540 acgtcatctc tttaagtttc atcttcatta tcgaaaaaac tggtactaat aatactcgtt 9600 ctgtcttctt agcggggttg ttataaaata aatataaagt aattgtcctt gagtacaatg 9660 caaatgcaaa tatcctccaa atgcggtgaa agtattttag acatgcaagg ggacatatta 9720 aatgatttgg ggatggacag aagagccctt gaaagatggg gatatcatca tccagctttc 9780 accggcagat tgagcagggc tcatcacctc aggagaactt tcttcagctt gggagtactt 9840 tttttaaaaa tagctttagg agacaccatg caatttacca acttaagttg tgcaattctg 9900 ttgttttcag tgtatgctca gagttctgca atcatcaaca caatcagttt cagaacattt 9960 tcatcaagcc atcagaaaac tcctgaactg attgcagtca ctgcccattt tccccctaag 10020 ctctcagatc taggcaacca ctaatctact ttctatctct atggacgtgc ctatgatggg 10080 tgtttcatat aagtagagtt atataatatt aataccttgt cttttgtgac tggcttcttt 10140 cacttagcat aatattttca agtgtcattc atgttatatt gcatatcaga acttcatttc 10200 tttttatggc cacgtaatat tttattgtat agatatacta ctttttgttt atatatcagt 10260 taatggatat ttaggttcgt tctttcttgg ctcttgggct gctaggaaca tttgtttata 10320 gtgctttttt gtgggtatat attttcattt ctcttgggta tatacctagg agtagagttg 10380 cttgatgata tgataactca aattaaggaa ctcacaaact gttttccaaa gcagctgtgc 10440 ttttacatcc tcactaaaaa tctatgagaa ttccagtttc tccacatcct catcagtgct 10500 cgttactatt ttttaattat atccatcata gtgggtgtga aatggtgttc attgcagctt 10560 agatttgaat tccccgagtg actaatgatt cgagcatctt ttcatgtgtt tattgtccat 10620 ttctgtctta tttgaagaag tgtctgttca aaggtttgcc cgttttgatt agattatttt 10680 tatcgtttaa ttatgggagt tctcggtacc agttttttta aatcagatat atgattgcaa 10740 atactttctc ccaatctgta gggttgtctt tttacctttt tgatggtgtc cttttaagtc 10800 ccaaaattga tgaagtccag tttatctgtt ttgttccttt gttgcttatg ctttgaatgt 10860 cacatctaag aaaccactcc ctaatcaaag gccacaaaga ttaactctta tgcttccttc 10920 taagagcact gtaggtgtag ctgttgtatt tagatcataa ctctatgatt cattttgagt 10980 taatttttgt atatggtgtg aggcataggg ttttcttctt tattcttttg catgtagata 11040 atgcaaaaga ctttcaacct caggcctggc atggtggctc acacctgtca tcctagcact 11100 ttgggaggcc aaggtggttg gatcacttga ggtcagagga gttcaagacc agcctggcca 11160 acatggtctt tactaaaaat acaaaaatta gctgggcatg gtgtcatgca cctgtagtcc 11220 cagatacttg ggaggctgag gcaggagaat cacttgaacc caggaggtaa aggttgcagt 11280 gagctcagat gacgccactc cactccagcc tgggtgacag agtgagactc tatatcaaaa 11340 aaaaaaaaaa aaaaagactt tgaaccccac tgagtcttgg ccatcttgtc aaaatacctg 11400 taaatgtgat agtttattcc tgaactctca attccattga actatatatc tatccctgtg 11460 gcagtaccac atagtcttga ttactatagc attgtattat gttttgaaat taggaagtag 11520 gagttctcta actttgttct cccttttaag attgttttgg caattctggg tcccttgcat 11580 tttcatatga attttatgat cagcttatca atttctacaa aagagccagc taggagtttg 11640 atagagattg cattgaacat gtagatcaat ttggggaata tttccattta aacaatgttg 11700 tctttcaatc catgaacacg ggatgtcccg tttacttagg cctttttagt tttttgaaaa 11760 tgttttgtta ttttcaaaat acaagtttgt acttatgtta ttacatttat tcctaagtat 11820 tgtatacttt ttgatgctat tgtgattgga attgttttct tcattttctg attgttcatt 11880 gctagggtat acaaacacaa tttggggagt tgcaattgta ctctgcaacc ttgttgaact 11940 tgtttataca ttctgatagt tttttagtgg atcccttaaa actgtttaca tacaagatca 12000 tgtcatctgc aattagagat agttttactt cttcctttcc aatctggatg cttttaattt 12060 atttattttt cctaattgct ctggctagaa ttttcattac tatgtcgaat tgaagtggtg 12120 agaacacacc tcattgtctt gttcctgatc ttagggggaa agcacttgga cttttgccat 12180 taagtatgag gttagctgta ggacttttga ggatgccatt tatcaagttg aggaagttcc 12240 gttctattcc tagttggttg agcattttta tcatgaaaga atattagatt ctgtcagatg 12300 ccaaaaactg tgtctatcga atgatcatgt tgcttttttt ccttattgtg tgacatatcg 12360 tattaacatt aatattttgg atgttaaacc aaccttgcat tcttggaata agtctcactt 12420 ggtcatggtg tataattttg tttatatgtt gctggatgtg gtctgctagt aatttttttt 12480 aagatttttg tgtctacatt tataatatac tgatctgtag ttgtgttttc ttgtcttggt 12540 gggagttatt ttaaatcaac taaaattaaa gaatgaatgt agggagagtg tttcaaggca 12600 gagaaaaatt taaaaatcat tgatatcgtg atgagacttc taaatatggg actatgagtc 12660 ctcagcagaa cataaccctg aagcacaatg gaaggagcac aaggtttcag ttgaagagct 12720 actgttttgg cttctgtcgt ttgctgtgtg actttgagta aactacctag gatttcctca 12780 cttttaaact caggataata atacctatgt cagaccttca ttatagggat taagtgaggt 12840 gatacatata gagagtattt tgttaaaggc caaacactat taaaattata cgtgtttata 12900 ccatgtacag agaaaaagaa tttaagagag agcactgggg gaacagaaca tcaccacttc 12960 aaaaaagacc ataataaaat ttttgaagca gaacaaaacc taatgtggag gcaggatcat 13020 tcatttgagg tagacaatga tttaaccaaa caaatagaat cactttaaag cacataccag 13080 atatgtgagt gtcatggtca taaagtttag ctttgtctgg gatggaagga agtcagtgta 13140 gattttccta gggggaattt ctgttccttg gtgtaaactg gataacagtg tgttttattt 13200 tattttgttt tgcttcactt tattgttttc tgttacatga acttacattg agttgagtaa 13260 atttccatca acttataacc tcctattaga aattttctag aggctgtttg aaaaacatca 13320 aaggacacaa ctaaaaaatg actacaaata tgccttgttt gttttaactc ttttcagata 13380 caaagccagc ctgcgtattt aacaatgtgg aatattatga tggagacatg tttcgaatgg 13440 acaactgtcg gttctgtcga tgccaagggg gcgttgccat ctgcttcacy gcccagtgtg 13500 gtgagataaa ctgcgagagg tactacgtgc ccgaaggaga gtgctgccca gtgtgtgaag 13560 gtaagaaaag gtgctaatta cagattaaca ggagcataca tggtatagag ttctaaaatc 13620 aaaatcgtcc taattcagtc tcatccaaaa gtaaaggaat attgaagttc cttttttagc 13680 cttttttaag acaccatggt ctgtctttta gcatgttaaa gccaggtaga gtgtgtggaa 13740 acacttccct aagggaagct atgcctgggt gtagacaatg ggtcacctct ttccacaggc 13800 aacgtgagat gtcccagagt tgttttccgt tcctcccctg cagttctagc aattcctcag 13860 caagtcctag ggcaaagctt tatcttcacc ctggcattag agctcaccac ctcctcacct 13920 cagattgctc agttaggata atgtgggcct caccctgaca gtctcagcct gccggcccag 13980 agaggaacta actctgccac tgtcgtaata cccaggaccc tccagggaga tcagggacct 14040 gctactgctt ctctgaaaat atgtggtggt tcacatgggg tcgtggtatc aggtcaccac 14100 ctggtgacac caactcaagg cccagccagg tgtcaggtgt gactctcagg aatctgcttt 14160 gtctggattc cccttttctc tccaagaagt aatatgttca tgttccttca acaggctgca 14220 tcttcccatc actgatgttc ataaacaata agcatttcaa aacagttgct ctgaatatca 14280 ctccctgcaa cccacacatt ttacagatta ggatgttgaa gttcagaaga gaaaagtcac 14340 ttgcctgtaa agtcacttta cactgcagcc agtaacaata accccgtgcc tgactgtgtc 14400 tatagtgttc ttctgacaac acaacactgt ttccccagaa aggaaggtct tgtgagcctt 14460 cctgtcctaa ccaggcccca gatcttcaat ctgaagtcgc atgtgttaat cttatgcctg 14520 atctctgcac acaaagtggg agggcacagt gtcatgccac tttccatact tcctcatttt 14580 cataaaattt tctccttttg atatttgagc aatatttaca acataatatt gaagatctga 14640 aaatgtttaa agtataaaaa ttgcattcct aacatagttc ataacagtga gttcttttcc 14700 ctactagagg ttattttacc caacatagga gaaaagtagt ttatttttta ggtgatgttg 14760 acaattaaaa agaattactt caaattttta acctttctaa ttgttttgtt ccctacctgc 14820 ccccagtctg caatagttct gaaactgtgc agcagttgat gctgtaaaaa tgtagatacg 14880 aacctctgtg tagtcatcat actgagttgt ctgggcatgg atccagttat gtctcacttc 14940 ctggaattca gatccaggct caacgtagaa aaatatgtat tctaatactt tttacacagg 15000 aatttaaaaa atcttctctt gctcaggtcc attggccgcc tccctctctg gcagactgcc 15060 ttcctaacgc tgtgatggca ttattttaaa tgctgacctt ctaactgact ttattataat 15120 gtcccactat gtccttagaa taatgatgac caaactctat gaactgtctg gattcctctc 15180 tcagcacagc agtagttgtt tgttgaagcc agttttttat gtatttctga gaaagaaaat 15240 tgagtaattt atctggttgc tctcaataga gggataagag agggtttcat atgcagtatt 15300 tatatacttg tcagtaacct cagaaacttt ttttttttta cagtagcttt ggctacttct 15360 gtctcccacc tggggctgca gcttagcttc ccaacttatg ggagaaaaaa tggcctaagt 15420 tcacagcgct cctccaaggc acagtgaccc atggtatgtc tagactagcc ttgtcatgaa 15480 caagtccaca gcaagaccac tgttgcccac atggtcgtgc aggtacatga gatagagttt 15540 tagaagtaac aatatcactc tgacttaggg cattgttgct tccgctttac agttagggaa 15600 gttgaggttt gataaacatt tatgtctaca actgcgaaga cattgtcaga gcctaaaatg 15660 gagtcatgtt ttctaattac caacccacag ctcttgccac aaagctttcc aactttacag 15720 agtgataaca cctgcacatt ggtttttaga ctacagatcc tcttctctgg gactgctgtg 15780 tcaggatgag tagacataaa ttcagtgtga cagcttcttt tgcatgcacc cccagagact 15840 cctcatgcaa ccctgcgctt ctgttgatca cagtgggaat agaactgctg atgactagaa 15900 cccagggaaa aaatgtctct gggtactgac agagataaga tttagaagat gctcagtcta 15960 actctgaact catgttctca tttagatcca gtgtatcctt ttaataatcc cgctggctgc 16020 tatgccaatg gcctgatcct tgcccacgga gaccggtggc gggaagacga ctgcacattc 16080 tgccagtgcg tcaacggtga acgccactgc gttgcgaccg tctgcggaca gacctgcaca 16140 aaccctgtga aagtgcctgg ggagtgttgc cctgtgtgcg aaggtaaatc ttgcagatgc 16200 taatgagtcc ctggtgaatg tcttctgttg gagaagcttc taccagcgta cgttcttgtt 16260 tgtttatttc cttgggcata atgtctgctt cacgctgtta ccagttgcca acaaatttat 16320 caaacagctc aaactgatga tttgaggaaa ggttgcacat gtgaaatgag aatagtgtgg 16380 tcattatttt gccattgcaa atatgcgcgt ctgaatactt tacagtataa gaatatgtac 16440 attttaatta agttcacttt gaacctaatt aacacccaaa ggtcagccta acaaaatgaa 16500 attgaaaatg agaccgatga gtcaatctta gcacctttga cttgtgatgc ttccatatgc 16560 ccaagcgtcc tggcaactag aagaaaagac tggaaccatg aaagccaatt tccactgcct 16620 agtgccgtca ccgagtatct gttctacttc tcatatacca atggctgtgg ctttgcaatt 16680 cctctagtgc tgttatttat gtcgtggtta cctgcatttg ctatctctct tcttgttcag 16740 tggagggaag cctgaggcat cctggactag tttttctgtc ccgactccta ttccagctca 16800 tttcctccat cctgggaaga ttctctgcat acttaactca tgtcagtctt taatcttgtg 16860 tccatgtgtg ttagtgcaga caagggcttg aatatttgta cagacatttc taatagaggg 16920 tatccagcat taagaagtac atttgcatag taaaatgaac taatgtggat caaaaattaa 16980 tccatgttga agatggatct ttctagaatt tctaaaaaca gatattgaaa aatggactta 17040 tctttccaga aagtcagtta agtagactaa gtcttttaag tgcgaagaaa tcttgtgctt 17100 ccaccgaaat gttaataaac tcagactcca gagtttgcct gactagcttc atcagcctct 17160 gaggggctag acttattaaa gaccttaaaa atcatataaa agtttaagct tgaggcctct 17220 ctttctgttg acatcaccga ttcccttgtc attactgtac tcagagtgca gctgccacac 17280 tccagaactt tacagcttcc agatgaccag ttttgtggaa gtcacccacc cctgcatgta 17340 tatatactcc tcccattccc taaaggcagc atttcagtgc tgaaaaaaaa agagcatgta 17400 gaaatacata tgatggttgt atagtcggcc ttacaggaaa gggtgcctta ctgaagcaga 17460 tgaagataag aatgtcagtc cctaagaatt agggacaatc tgttgccaac ttgaaatatt 17520 taattacgaa tatttgggac acacggtaag gaatatgaaa agcataaatt cataatccag 17580 taaacattag ccttttaacc tcatgggtct tatgtcataa attcaaaata aaactttaaa 17640 aacttgttga ataaaagagc ctttcctctg gaagtaaatg aaaggaaggc tgatcctttt 17700 gcacccacct tttgcattta tcagcatatc catcatgaag agtggagaag gaaggcagtt 17760 ggtgaacagg tagatatata tataactcat ttattaccat tttcttagca cacattgatg 17820 tttgggtaag gagtgaaact tagtgtggtc acttagaatt tggtctctgc aacatctgca 17880 tagagttgta atctaatcat ctactgtttg agaaaagagt tacagttttg gtgagccatt 17940 aatttgaaaa aattaagaac ctcctggaat tcagatacag cattggacac aaagataagt 18000 aagagataac ctttaacttt gaagagccta tttaggctga gttttgaatt aaaccagaag 18060 aaagggaatg tccccgttga ggtatggtgg ggtaagtgca tcacatttgt acccaatttc 18120 atttaacctt cagagcaaca ctgctgaagc ctttattatc ccattttgga gatgaagaca 18180 ttcaggttta aggcgattta gatcatttcc cccaagggta tagttgccat tgtgtggctg 18240 agtgaagatc aacggaagtt tgtcagattc ggctccattt gtgcttgctt cccctgcccc 18300 acacagcatc ccagcctaac tcatgccctg cttctagtgt cctcacccat ctctactcac 18360 ctctggcttt tggctctgtt gaggagtgag ggcagtcagc ggaggcaagg gaggaaatca 18420 ttttgcagtt tgaaaactac tccaggtatt ttgaatgtga tcgtccacct actgtctcct 18480 cgcagtttga agacctcagg ggaaagatgc ctttgagatg catccctgag ttcacttcta 18540 gagcctcagg tctgggcagc cccctccacc tcaacagagc tcagcatcct tgtcatttca 18600 gcggtaaagt tctgctggcc ttttttcctc tctgtctctt tcatgctctg gctctgatct 18660 ctcatcaaac gtttcagaag attccatttt ctggttctca gaaataattt atcttgcttt 18720 tcttgttttt gtttttaact tagttgagta gtgagtttca tatgaatgta tttactctcc 18780 aataatgctt tcattttaag atgtattaat aataagaaca cagattattt tagtatagga 18840 ttaaaaaaca gtgcacagct taagataatg atactaaagg gttatttatg gctactaata 18900 ataaacttac aaaataacac aaggtgaatg gatcttactg cttagtttat agggtccaag 18960 aaagtctctg ggatcatttg aagacatttt ttctcagcag tcattcaata tcatacttaa 19020 atataatttc aaatgatttc agagtagcaa aaggattgta tgaaataaga aaaagtgcaa 19080 ggtgttaatg tgcaataaat aagcaatcac ccagaaatag ttagaggaaa accagaaaca 19140 ctggtattta cttagtttat acacacaata aattcctctg tgcacagttg cgtaaccagt 19200 gttggccctt accctattag ctacagtctc taggtacatc gtcagagtat gaaattagat 19260 gaaaggaccc cacacaagga ttgtggtttt agattctaag taaactaaca gcatcactaa 19320 atatggcgtg ttaggacctc cagggaggaa gaaccttatt gtggctgttg ttactggaga 19380 cacatacctt ccaaatgaca agataatgtg tcttttggtt actcctggag gactagggac 19440 atgaacctgg gggcccttaa agactaggct gttacttgac agctaacttg attcacaagt 19500 acctaagtgc tttttaaatt ggcacttaag tactctcatg atgaattctt agcaagttat 19560 tgtcaacacc caattcaggt atacagataa gcctttccag tctgttaact gattatattc 19620 tcacattttg ctttgttgac ttttctctac atctcttgtt tatcgatctc attattttta 19680 ttttctgacc cctgtgaaga gctagggtgg aaacgtgtgg aatagcatat aaagttttca 19740 gttgaatggc atacatagac gacacagcct taatctcctt aaaaactaaa aacaaaaaag 19800 agattggacg gcagaattag gtcttcatct gcagggccac tgttatttgt aacccttaat 19860 gacagtggct aacatttcat acaattgtta tgtgtcaggt ttaagtgctt taagaggagc 19920 aactaatttg agatcctcac agcatccctt cggtatgggt gctgtcatcc cagtttacca 19980 tcagagaaca caaagttcgg aggccatgta atttgcccag ggtcatacag ctcataagtg 20040 acagcacagt gtgtccccag gaccagaccc tggaccctga atggacagtg cctctgcgct 20100 gcagtgcatt ataccctgaa agaggctcca tccccctgta aagaaagaat ttggcatatt 20160 cttttgaatc cctaaaacaa catgtgatgt gtaagtagat gctcaaaatt tttaaatgag 20220 taaataataa gtgaggacag tgaggaggaa agaaagactg gagcaggaag tcccaaccag 20280 acccttcata cagatcttct ttttagttga gtgagggagc agaccctgat acagggttac 20340 cttaaccaga gcgtcaaaag ctgatggcca ttgggacccc tgatctacta gttgaaggaa 20400 gaataatgtc agcatgctta tcactgaaaa cacattttga gaacataagt aagagtgtta 20460 tggaagtgtg tgacatgagg tgggcctaca tgtctacagg ccagttggac cagtttcctg 20520 cattgacggt tcacccttct gatgggcact cagctctggg ctccatccct gtgtcagcct 20580 catgggcctt ccaagactga tgtgtattct gtggtttatt ctgtcccagt tcagcccctg 20640 aagtccagaa tcacagagtt taactggggc ttcccagacc ctctaaggct aatgcttgct 20700 gccacgttaa tgtactgttt ttaagacgcc ttgtgcatgg tgtaggtgcc actgcatttc 20760 agttacttac acgggcaagg atttttttta cattgtcatt agatattgag aattatttgc 20820 tgattttttt tttcaataat agcaagtata aatgctgcca aagctgattc ttaatgtcag 20880 agggatcctg aggagaggtt gccatttcca ttagaaatct attttgtgat tgaaggtaag 20940 ataaggtatc ggggagggaa gttttggcta catgacatct ataacatccc cgtgattact 21000 caataagaaa aattcaagag atgtggacag gtgtgtggcc agcttagctg tgacacctga 21060 tgcaagagat tagagcttcc tggggacttt gaagtcttga tgtcagacct gatctgcctt 21120 ctcatcttgg gctgcggctg gaccctgccc aggtggccac tgtgatctga gtgttcttca 21180 ggcctcctta ggtgcaccat ctcctgcacg gaggagctga tgattttgat gaccttaggg 21240 attttgaagc caacaccaga gtatcacttt tcaagaacta ggaacaactt aggtaatatt 21300 ttaaaaggta gcattcagaa ttatactcag gttgccagta aaaaaattcc gtgctagcgg 21360 ttgggcaaat gggtagtttc ccctctacta tttctgtctg tctgaaaaca aagcaaaatc 21420 catgttctat ctctaacagg aagtcaaaga gttatttgta gattgttaaa gctggatgag 21480 aggtatatca gagtccatag tattcagtat tttttaagtt ttcttcttaa taaaaagtta 21540 aaattgctat ttgcagacat attatcagat acccagctga gtttcaggcc taataattct 21600 gctaacctgg acttaactct tgaagtagag gatagaggaa taagagattc actgtgcatg 21660 gattccttct aattctgggt ttaggatgga atgtggttga accaaaaaat ccactatctt 21720 tcttcctgcc acctggcttt cttggatctc ttacttaggg aatcctggtc ctacttggga 21780 cagtgttgaa gactggatgt caggcacgtc ttgacctatg tggatttagt aagccacttc 21840 agtctcagct cacccaatac tgtaagttgt gctcctctaa aaggctggag ttctgtgagc 21900 tgtcaaatat attgtcgtct tcttcatcaa tcccaaattt atggaagcac tatattctct 21960 ccctgaaggc tgatgatata gcttgaatca agctgccctt ctaatgagat ttccactggg 22020 ctgccagaaa agtctgtacc aggcacatat ctgggtggga attctttgaa acaagaaaaa 22080 atgtttccag ccctgcccat aatgcttcct cactgaacag tatctcaaat gattaagcta 22140 ctgcttttgt ggcacggagt caaacatagt ttggctgacc ccaagaggca gtgaagtcaa 22200 agatgccata gggtcacatg ttattaacat tctggtacat aacagggatc aaaactcagt 22260 ttatgggtgg tattgcttca gtgccatata cgatgctatt attagtaagc caagaataat 22320 aaaatattcg taatagccta aagactgtta ggcatgtgtt ttatccttta actctaaacg 22380 tcctggaaac tagttccata aatggaaatt aaaatataac acagtctatt ttttgcttgt 22440 ccaaataagg tcatatgttg ggtaacaaat gcgtagcaaa tgactgtttc ttaacctagg 22500 tctttaacat aaacatcaga aatttttcaa tgttgaatat tgtcatattt tgatttttct 22560 ctgatcattt aaaatgtctt tacatatttg caataaaact agaatatagg taaagtaaat 22620 gcaaaagaca ctcacatcca gccttaatat atcactttca agattttgtg atttttcaca 22680 gctatataga aatatgtaca ttatatgagt ggctttcttt acatgttaga tccaccactt 22740 acattaatag ttaaacaagt gtagaaataa atgttgtaga agtagtttaa tagagaagaa 22800 atgtttattc ttacttatac ctaggagttc tgctgctggt tctccaagtc cagcatgtct 22860 ttgcttgtgt cggtaatgcc ttgctctttg tacttgattt tgagtaagac tgtcatactg 22920 aaccaagttt tacgagtaaa aatgaagatt ggtcttatac tcttaatttg ctagatgacc 22980 aaactgatca atgccttaaa tgttcagaat tgttgagagc taattctgaa acacatgatt 23040 agaaattcct ctgcccctga gtttacagag ctcagctgtt aagtatttcc ctgttgaagt 23100 ggcgtgcaag ggggtgggtt gctctttatt tcctctgctg tctacttccc atacctaaaa 23160 gggtattatt gctgggagtc tcataatttt cctactaaga caataattgc atctaataga 23220 tcagacttgg gcctgctaat tttcatctaa tcaacagagg caggaaaaca tagtctggaa 23280 tgctaactat gtgaatgaaa aatatttagt cattcttgaa atgataggaa tgtttttcca 23340 acaagcttaa ataaacacaa tctcttttga gactgtgcta aacttttttt ggatgagaca 23400 actcttaggc aaccaaacag aagttaatga aggttattaa gagacatgcc cttcagaaaa 23460 ggcctgcaga ctttaaaaag aaaagattcc atcttcttta tgctgcaacc attgataaat 23520 agcagaattg gtttactttt tagcttttgt ctacccagta gaatttaaca ttttaattag 23580 taatcatact gcccaatgtt caacagatac cattttacct aacatggcta tatgagacca 23640 tcttatttta aaatcctcat cttcatctcc tactgtttgc ctcagttcca aggaattcat 23700 tggactcaac actcctgtat ttatagcacc tgattacact tttcctatga agcatgtttg 23760 tttattccca ccaacacaag gtcccgccag ataataaaat tatttttaaa catactaaaa 23820 tgtacatatt caagtgaagt gtttttcctt tcacaagtta tatgggaaaa atgtatactt 23880 actccaggga tacagtcatt cagaatattt ttggtgcctc tcttttgaaa ctgactttag 23940 agatcatagc acatcctttt gaatatcctt agctttgtaa acccttgact ttgaaggttt 24000 attagactct aggaaatagc caaagtcatt tatgttgaag tctggtttaa aaaaaaaaaa 24060 aaggtggccg atctagctga gtaaaactgt ttagaggaag caaaaagtga cagtaaaata 24120 aaggcacttt gtcttctatg acatagtaat ttgctctgaa agcaactatg aagaggtttc 24180 aaacagaatt ttaacatgaa cagcattttg gcataagaat atagcctcta aaagtgagta 24240 cttttatcaa taacacttat ttgtaagtat aactcctggt gtgtttataa aaaaaatcaa 24300 ttcacctact acatagtatt tttatgactt ctagggatta gaggcatgta tttaaatggt 24360 catatactgt attacaacgt ttttctgaaa ttgcaggtgt tctattccaa catttttttc 24420 aaatacatta gactaaaata atttgcctac agaaacaggt caaaataggt tgcgattcta 24480 atataattta tttttagtat tttagttatc caattccact tctagttata ccatactgtt 24540 tcctggtata gtcttaagta tatcctataa aaacagaatg atttcacatt gcttaatata 24600 tgagggttag cactgaagat agtgaaaaat ccagtgagtc actgttaaac ccccattaga 24660 acttttctgt gttgtatcta tctgattatt gaagatacag tgtcacacaa gcatataaac 24720 catctgttgc aatactattt taaaatcgta ggacaggttc aaagagggga aatgtccatt 24780 aaagaacgcc tcttgcgcat gtgtgggtct ccccatcaca ggtggctgat gaggtggcag 24840 gagcagcatc atacccacct atcagcctct taatttgaga gccccaaata ctcccctgca 24900 tgagctccca gaatgcagat tgggaaatct gttgccaggg gactcagcaa agccagatct 24960 tggtccaggg gtcataccat cagtttaatc taaatttgat gtgctttgag caatacggga 25020 catgtaacat gtttctgaaa tttgccagaa agcatgcatt ttgttttata aatattttga 25080 gtttaagttg atcagcagtt tgaccaaatt tttatttacc atttatatct acagaatacc 25140 tgtttacacc tcagtggagt ttaactttcg catctgagag attagctgga tttatctggt 25200 ttgtttgaaa cagcctagtt caggtatatg gaccatttga gatcacagaa tttaatgcta 25260 gctgccagta aaaagatgca gtcccaggct ccagctctta ctttgcagca gcctagtgaa 25320 acacaactaa gaatcctcaa gatcctaagg ccaagtactt gatccttgtg gaccctgcaa 25380 atcatgtaac tttcacattg gctgccggag caaaccgaga gttgatttgc aagccaccgc 25440 tcgaccagca ccagaactca aagcatgcgt tctgtgtgct tacctcatcc ctgactgcag 25500 cttattcttt tctagctaaa atttacctgg atccccattt ccttacttta aaaaagtatt 25560 cattcatgat ataatttgag actgccttgg atctattaca taatctaatg aggtatattt 25620 atgtgcttct ttatgtataa tcagtgccta ccatttgccc gtgcctgtgc ctttgtgcct 25680 tgcatggtgg ttgaaccccc ttagtttgcc tatcagtagg aagacaggag atcatgttgt 25740 ttttcacctt aatggacagc tattgccctg catgtcagct cacttttaaa acaattcaag 25800 gagtctctat ttgttaggga agctgtaata tgtttggaga taataagcta tccacttggg 25860 atggcctcgg gccttgcata cggagagggt cggcaagctc agatcttttc aaagccataa 25920 atctctcatc ctttctggga atcaagtttt taaaagattc catggagatt ttatgttaga 25980 cgaattccac agggggtgtg tcccatagtc ctgtgacaga gatactaaga acatgactca 26040 ttgcatgagg agcccctaca attcttgtta tgaaagtact ggttgaattt tctcttactg 26100 ttgacatatg tttgagatcc ctgacttcct aacactgaga tctgctcgtt aaagacagtt 26160 tttagaaata ggtcttctgt gtctttcaac ttgagagtgg caagaaacac tggcaaagat 26220 aaaaagtagt ataaatgatt ctgtgatgag tttgcccttg tccctaaaga tctagcaagg 26280 cccccatcat gagagaggaa gaacaggggc caggcacaca gagtcattgt ttcgacccac 26340 attggattag ctgattacta agagatttct gttctcttgg agacatcata gacatagtca 26400 aaatgtagaa caatgatatc agactgctac tcagcaaatg tgaaaggcgt cactggctgg 26460 tcactaacgt gctccaaagc ttaccagacc atatgaaaag atagcacaga cccacccacc 26520 gtctcagatg cttttcacgt tatcagtgga gacaaaaaat gatgcttaag ttacaacagg 26580 cagggttcag aatgctgaac caattgccat actatttatc agctgtgcat ttgaggtctg 26640 tgaaagggct aagaattcat taattagcca ttacctgaga ccagttatag ctctgagcag 26700 acaagatcag catgacatca ataattacaa caagaagtaa tgttgctgtc acccttacag 26760 tatagtcttt tcttgacaag cttgtttgag cattattgga atattgtgag gtagaaataa 26820 tgagtattcc cattttctgg aagcccagag aggtccatca taacatgtta gcctaatgag 26880 agagcaggga ctcaagctat gttttctgac tccaaatcca tgaccttggc attgcttccc 26940 agtagacccc cattgtgaga agacgggatg agaaaaataa gatgatgaat gtcaaagtac 27000 ttgatcattg tggcttagat tcaaaggcca gtccactcct ggctccatcc ccattgctga 27060 gacccggtta ctgttattct gaactctgtg gcccagcctt tgatcccggg ttcttctgat 27120 gagcctctga ctggaatttt gaccttgctg tttgacttct gggagagctg gatccgtttt 27180 ctgcttctac ttgtttattc ctgcatcctc ccacctaggg acattactca acccaagagc 27240 cccgacccac tctcccatcc tgtcaggctt gacctctgcc cattcttgtg catatccatc 27300 caaattctag gccccaggcc ctaaccattg gcacttaagt ccccactgag taactgagct 27360 gctcagcctc catcagtctg ccaggcacca ggataccatc atcagcacgt gtgcattctg 27420 ctatgcactc ttccttttct ttctctgatt cttgtcagtc tctaattaag ccggttgtca 27480 atttgccaag cagtcagcaa aagtgcccat aacttcacgc agtagaagat gagctcctga 27540 gttgaatgac tgcctagctg ttttcaggat tattttaata acacttttta tttggcatta 27600 tcaaggagta gggtagttcc agaagtgaat tctccagata attaaaattt gggatgtcag 27660 atcttacatt aagaatttta taggaggcct tctataatgc tttgtgtact tttctgcctt 27720 cttaagccaa agatacagat catatgatcg tattttaacc catctcagga aaccgagtgt 27780 catcactgag taatctcatt ttggacggct tggatttggg gatttgcact ccattttcac 27840 cattattaag tgtgtcaggt ttacaacagc cttaagatct ggctttctgc ttcagtggag 27900 ctcatttatt gtgtacctcc tatgtgcaaa agactgccac agtggtgtgg accctaagga 27960 acttactgtc taatcttcta aaagttagaa aattagagat acagcctctt aggacaattt 28020 tgaatgggag ctcttctttg ccaaataatg ttacactgac tttaggtgct tagatgtctt 28080 tcgttttccc gttttgttca ttcattgagc agttctatag accaaactgt actaggtgct 28140 gaggttataa acaatgaatg aagtattgtc cctaacgtgc agacccttcc tccctggttt 28200 tcaaccattt gccagaaggg taggttccag ttaccaaaag tctgaccgcc tgacccatcc 28260 tccaagttga aaatagctac ttaaatgtgc acacacacaa acccgcatac ctattaatgt 28320 aacacattat ttccacttaa agctaccaaa atataattcc tatgaacatt agcaataaaa 28380 agaataagaa taaaatgtaa gacaagtata aaagatctag attcttagaa tccagtgata 28440 attttgtgag ctatctcaag ctggattcct ggaaaaagat accccactct ataaaaaaaa 28500 attttaattt gtcttttgtg gtcttttaat aatctcaata atgctaccat cttttcctat 28560 taaatatgtt tttggtaaag gaagaaaaat attaatatgt ctaacagcca agattccttg 28620 ggaaaagcat acaaaattaa taaataggtc taaaataaat acatggttgc ttccttttcc 28680 gatcacaaaa taatacaaat cttgggaaca tttgcacact gttcatttta aaataaaata 28740 aactgctctg agataagtac ttcagaaatt acaaggatgt aggtgttgtt atgagaagtg 28800 tacaaaggga aaaaggataa ccaaatgtgt agcggctaaa agaatttgaa aactccaacc 28860 gtaagagact tttttccatg acttaatgag gctttcttaa gtacacattc agatgctacg 28920 gaaattccca cttccagatt agggaatttt tgtatcttca attttttttt tttcttattg 28980 tctgctttag tagagttttc cctctagctt catttgggaa aatttcatct tcactgtact 29040 ttccttccca agatgcaatt atgttgacaa tttagtgatt tatttgaaag gtgttctgac 29100 cagttatggg tcacatttga aaagcatctg tgttaacagc ttgacatatg aaaatacagc 29160 aaaacagaca agctttgatg ctgtcttggc cagccttatc tccagtttta tggccctggc 29220 taggagacct gatatattta gccattgaac tgccatgtca agtcagtgac attttggaat 29280 tttcctactc attggatatt cacgttgctt aaccacgttc agccccctcc aaccccccaa 29340 tccctgccca aaatagaaaa ataaaactac cttaagaaag agcgcctgaa gtaagtctga 29400 aaagaaagca taattgttca tttttgtctc tcataatcaa ataagcaggc ttgggaattt 29460 acaaagaaga gctatgtaaa atctgcaaag cattattaaa gcctgaaaga agaggtaata 29520 agagaaacga gagacagata ggaattgtta gcttcattta tcttaatact tccacctgag 29580 tattttttca cttgtaaaga aatgtaagaa tgtgtgtgtc tttaaaaagt actttttctg 29640 taatcccagc actttaggag gctgaggtag gaagattgct tgaggtcagg agttcgagac 29700 cagactgggc aacatagtga gacctcatct cgaaagaaga agaagaaaaa taaataaaaa 29760 ggaaaaaaaa aagcattgtt ttctagaaat ttgcctacaa agaggttttt ccagaaagac 29820 ttttcttagt gtgtgggtgt gtgtgcatat gaatgcatag gaacagaggg tgctggcaga 29880 cagctgcttt ccacagccag gcctgcagtt taccagagga cagcacagtg aggcagcagt 29940 cctagagacc atcagaatct ggaaacagag ttaattaccc tcctgacaag acagtactac 30000 acacagaccc aacagggatg gctgtgtatg ccttccacag ttgtacattt acagagtcac 30060 ccacccatac tcaagaactg ctaattggtt gtcagctttc ccagggctgc taatcagaaa 30120 gagtgcttca cacatttctc aaaggagata attaatatta tttctgaggg agaaccagct 30180 catagtaatc tatctaagaa ccttgtatgg attaattcaa aaccagcagt tcagtggagt 30240 aatacagatt agacagcaat ttaaaagcat ccattaatat tgtcatttag gctacatata 30300 cctgaaatat aaaaattatt gtattgtttc agaagtgaat atatataatt atacatttaa 30360 aaataactga ggatcaaagt atgttcattt ttattgttct cccagagttg ttgaactata 30420 taatgtgagg gtaagactga tctacaaact ttataccagc atcttaaaag gctgaatttg 30480 agtttcagct tcttacttcc ttggccagtt tttggagaat ttctcataaa ataaactaac 30540 aaaaggcaag aaagtagaga gcttagattt gtttgcttaa cacatttata acacctgaat 30600 aaccgaatat gtatttatgg tttagtaccg tggccttgaa gttagattca tgattttcat 30660 tttaacttta tatttttaat caaattaatt gtgatcccag tagcttaatt tcgccattga 30720 gtagataaaa ttgctcctag gaaatatgtg agcatcaata taaaataccg atggtgtttt 30780 tcaataacag tgtctgaagc tctctaattt gtaaaaaata ctaacatttt cctggaaata 30840 attttactga caagacatct gttaggcttt ttaggttatg aaagaagtaa ccctaaattt 30900 ttaataaatg taaaataaag aatcaacatg tgtttgtgta tgttattgca cgttagaaga 30960 tattttcttt caaaattttg ttttctaaga cttgggataa gaattggtgg ctgagcatct 31020 gcactcctga cacaatccca agggttcttt aaaaatagct cttctggaca gtgtaaggtg 31080 cttggagggt ctctaaaaat cgttcaagaa aacaacaggt agaggagagg aaagtttccc 31140 actcagatgc tgagtagaaa acagcaaacc aatgtagaca taacaaacac ccatttttgt 31200 tctttattgc ttctaggctc agaatatttt agaattcttc tccacaagga agctgttgga 31260 gggttccctt tagaaatagt tgtagccaaa gacagtgaat tcctgctgtt tcaagccttc 31320 ttcctttgga ggacattaag gtatttcttt gattcctcag ctgattcaat acaagatgag 31380 atagaatttc aaggtgatcc tataaacagt acttcactga atttcccctc tccaaaaccc 31440 cacctctgtg caaggatttc ctcattctcc taaatgagga aatctaagaa catttctctc 31500 ctttgaaaga ggcctactaa cacatttcta ccaagctagt ttaacggggg acagttttct 31560 ttgctccaag cagcaagtat catgtacagg gatgcatcag tgccgccaga aatcttagca 31620 ggacctctgt tcaattgaac tttgtttacg aaacacgcca aagatagggg atcaccatca 31680 tcctagaaag gcttgtattc tccgacacac agtttcagtg ccataccccc tgtaacatag 31740 gagcacagaa atgtaggatt taaatgagaa tcttctggtc ttcagaccaa aatccgacag 31800 gcaatttccg tattcaaagc actgttgcat agactagaga aaagtctggg ataataacaa 31860 ttcctgatga gcaaatttca gaatgctcta acttttagac aagcagaatg tatatttcac 31920 atactttgct cacactccct caagcctctt tatttattta tttatttttt tttttttttt 31980 tttttggatg agaaacagtc atactgagat actatgattc tagattcttt aaaatcacct 32040 cttctcccac caggaactct cttggataaa agctatgtta agaatctgac cagctcttgg 32100 tcattattct ttaagaacta taaatataat attgatttga aatgctttaa ttgagaacag 32160 tgtcaaatat gtctcacttt ggagctaatt atttagcata agaaattgaa tctcagttgt 32220 ctcatggcac ttgtttccaa aaaattgccc tgacattagt aataggatca gttctgagat 32280 tatataataa tatgttggac tggaagcaaa ctctcaaata tgtttaacaa taatgtgaga 32340 tttttttcta gataattcat aaataggtag atagtctata tacttaccgg tcatattttt 32400 gtcgtaatgt tgttctcatg gaaatctaac tactctggct atggtctcac atgataggaa 32460 accaagcact aaaatgttgg gagaatgacg aaaggcagta cactataatg gatgacttca 32520 attgaaatgc caggagacaa ctagttgcct gcaaatgtgt acaactcaac agttactcca 32580 attacatagg gtattagcag taaggaccca agacacatgt gttaccaccc ggaaaaaaac 32640 tttaaaattt gttccagttt gtgtctgagt aggaaaatga gatgtatttg ttaaatgtaa 32700 taaaatgaga tgccaagtgt taaaatgtct tcttaagaaa acctagattt ctagctatat 32760 ttaacatatc acaatgacta ttgttgtgtt caccctaatc tattgagggg aaattgaata 32820 gttctaatgt aaattgaaag ttttcatgaa taaaagattt tgttgtgttt ttctgtgttt 32880 aatgttaaat attgagggta taaaaatttg aaatggctgt acttccctgt ttctttcatc 32940 taatataggt cataagtgaa tgggaaaata ctaatttcag aggcacaaaa tgattttttg 33000 cttcttgcat gctacagcgt tgaggcttgg aaggtatccc atgggacttg ttttacctga 33060 aatatttaac acacacattg taagaccagc acatactgaa cggcccttta tgcaatttta 33120 tttgttcaag tgaacaaaac ctgttatgct gaatatttta ggaaaacata ttgaacatat 33180 ttcctctcct ggagaaatgt gtgttagatg tacatctgtg actgtgtacc taactgttga 33240 agccatttgg atagattggc ttctgtgact gatacggggc tgcccaagtc agcattgtgg 33300 ttctgatcct caaataaccc agtccatttg ttcctttcac tttctagcca cagatcacaa 33360 ccttaatcgc atgttaatct aatatatttt ttaaaaaaca gcacatacat cactattgag 33420 agcaaatatc atttagaggt agaagatcat ctttgtaaaa gttggacgat ggattctcta 33480 agcaaatccc taaaattcac tgctgctaaa attccatact accctatgat atgatgatta 33540 tacatttttc aaggtcagtt taatcttcag tatatacttt tccatagtat gttttctgaa 33600 ggcaaaattt tctcataaat ttcacatatt aaatgtcagt aaagaataga agattttggt 33660 atgtgtagat atgcttaaga catttcttta cccataagct gtccaggaaa gcagtgtgcc 33720 tagagttcag aattcttccc tcaactttgg cctaatcgct ctagaacata gaagtccttt 33780 ctcactggga taacctgtgg tgggtttact taaacctgat acttgccttc tacgctctct 33840 catatttgtg gttagagtaa ttttgacaaa acattggcag tcttgggact gaagactgga 33900 atagaatcat tatttggggt agctaagaaa agtagtgata aaagaaagac taataggttt 33960 gagaactaaa gatgtttgca gtaaatttta agagaatgga ggggagcagc aatacgctaa 34020 ctcaggatgt cttaaaacaa ctaaacttta gttatttttt gatgtttaca gttatctgtc 34080 aggaagaata ccttggttat tgaaggaaat tagaaactac atcagtgaaa agtttatata 34140 aaaattttag taagatgctg ttctttgtaa taagaataga gaatgtgcta tactcacttg 34200 atacatattt ttactgggag aatgctgatc taagcatctc tttagtaagt catcgttgcc 34260 tgctacctgt gaagagttga atcactgtcc ccttttagta gatggataaa aatttagaac 34320 tcttgatctc agcagtggca gctaaaacac ttattttgat ctctcttccc tggtaacagg 34380 agatgaaact tgagaaaaca gtgtggacta atggatagag tgttggacag gaggttggga 34440 acctctgttc tgttctattc ccagcctgct actgggttac tatgtaaact tagataagcc 34500 agttaccctc tctgcacctc atattttcca tctgcgaaaa tgtaggtaga gacattctat 34560 actcagctct ttcacaaaga tattgtgagc aacagcttac tttgaagacg ttagtatatt 34620 tttgttgaaa agatgttgca taagtatgtc agtgcctctg ttccctttgg caacaccgtt 34680 cccacaacct ggcattatat aaactgtccc ttattgatgc attgagaaac aggtaccatt 34740 acagcagatt ttgtgttgca cagaagctga gagtgtgact ggagtcagac tgcctgggtt 34800 tgaatcttgg gtccaccaca aactaatgct gtaaccatgg gcaggtcagt gtgctagatt 34860 tcctcatctg taaattgagg ataataatag cacccacctc attggaatat ggtaaagctg 34920 gaaggcattt atgaacagca ctttaaacct tatataaata cataatatgc tttatgtaag 34980 tatatgtagt taccgttatt acccaaagca attcttcatt aatatggtaa agcgtcacct 35040 ctggtcttgc tttgtttagt gcccttattc ccagcatggc tgctcactct ttgtcctgaa 35100 tgtgtgtatc tccctctgct cccacaagtc tctgggccct gaggataagt gtgtagttat 35160 ctgtcgtatc atctaccata agtgagggag tgccacataa aagataaaag aaatccaggt 35220 gttttaagct agtctcaaaa actgatttat taatttggaa aaaaaatagt aaccgatttc 35280 agaagtatta cagaatggga ctgagataat tgatagttga cagctgtatc ctgattatta 35340 taaaaaataa accggaaaac atcaggaaga aatagaagag cagttaacaa tttaaacaga 35400 gttcatgtta acctctctcg tctgcttcca gtagtggtga ccatcagaga agaattgtgt 35460 gaaggtacca aaaactgcat gaattgctca aacgcaaaga agcagagtgt tcacattttc 35520 aaaaatctga acacaaatat atctgtttca ggggttaaag atattactgg agcataaaag 35580 tttacgattt gatgacaatt actgatttga tttttgtaac attatggtgt ggaatttcaa 35640 aaattgaata gcatctaaaa ggtaatcata tgtttcattg gttatttttt tctctattta 35700 ttattagaac caaccatcat cacagttgat ccacctgcat gtggggagtt atcaaactgc 35760 actctgacag ggaaggactg cattaatggt ttcaaacgcg atcacaatgg ttgtcggacc 35820 tgtcagtgca taaacagtga gtagacagaa gactgtatgt tttttctgag gcctaatatg 35880 atattgtcaa agcatattat gtaattgaat atctttttca cttctgttca gacattccct 35940 tgacaacctg aaaaaaaggg gaaaaaagtt atttttaaca caaacaaata catagtcatt 36000 tgggagaaag gagctgctgc tttattgtgt ttttttctct gtttccttga tggccagttg 36060 catcaggagt gcaacctgga gtggctttgg agtgcatcct tccagccacc cagaacgctc 36120 tccctttagt acacttgcca ggtcatccag tgacaaggag gtcacccacg catacagtgt 36180 cacattccgc atatgacagc cataatcctt agactgtgct tcactttgtt gaccccagct 36240 ttgcctccat gcagattcca atcatgtatc caggccctgc cctctagagt atagaagagt 36300 atattgtatt tcttccagaa aaaaataaaa taatacacac acaccccacc cccatatatg 36360 ttaaaaccat ttcgaattcc cctaattctt ctgccggaat aaacatcctc attctccttg 36420 accattcttc aggtgacagt tcctagaact ttgttgatct ttaccatcat ttgacacaca 36480 atatgtcatt gtctttacat aaaatatggc ccacaatcaa gatactccag aatgggcagg 36540 catggtggct cacgcttata atttcagcac tctgggaggc taaggcaggc agatcactgg 36600 agctcaggag ttcaagacca gcctggacga catggcgaga tcccagcctc taccaaaaat 36660 acaaaaaaaa atagctaagc atggtggtat gcgccagtgg tcccagctac tcgggaggct 36720 gaggtgggag gatcgcttga gcctggtggg gcaaaggtca cagtgagctg agagcgcatt 36780 actgcactcc agcctggatg acagcaaaag acactgtctc agaaaaaggt actccacaat 36840 ttggtctgct gaatacagtg ctactgtgtg gcctaagttt gcattagctc cttgaattgc 36900 acttcgttgg ctgatattga ggttataagt agctaaaaac actaggaact tttttacatc 36960 acctgctgtt aggtggttct catctgttct gtaatagaac gtttgatttt ttttttaaac 37020 gatataaata cccggccaaa tataacaaga aattatccga atcattttaa accattttct 37080 aactagttgg gatatttttg aatctcaata atgtgatctt atactttatg cctcccagct 37140 tatattaaat aagcacaact tatatatcct tatctaaatc actgaattaa aataaactgc 37200 acagaaccaa ggacaaagac ctgcagaatg atgctgaaca tagctgctta cagtagtcct 37260 aagcatttat accttgagta gagttattta gctgacaaca tatatctcca caagaaatct 37320 tgaaagacac ttcgcccgta tcttgtctga agtcaaaata tattgcaaaa tacatcagag 37380 cattcctttg ctggaaaccc tatttttaaa atgagctgaa gttatagggg cctggcttat 37440 ttttccagaa tctgtactgg ctcctggtag tcagtgcttt attcccctaa atgtcacaaa 37500 tcatatattt aacaattatt tctgaagaat tttcagttgt gaatgtcaat ttatgtagtt 37560 tttagaacac tttgtccctt ttgaattgac atttggccta ttttcttaaa tctctttcca 37620 ttctcttcga tctctctcac acacagtact ggttctgtaa taagcgggct ttctgtgggc 37680 ctgtggttgt aattaccatg gctcagagac ttggacttaa cgtagacaga tgctcacttt 37740 ctgatttatc agttatctca ggctttaaca taccacttaa ccttaattat ttgtcactat 37800 tttaaaaata tttttgaaga agatacaaaa taggagttaa gccattctgc ttcctcagtt 37860 acatgtgacc atcacactat ctacagtggc gttagtgcta aaactgagtt tataaaacgc 37920 ctgttggcca ggcgtggtgg ctcacaccta taatcccagc actttgggag gctaaggtgg 37980 atggattgct tgacctcagg agttcgaaac cagcctgggc aacatggcaa aaccccatct 38040 ctacaaaaaa aaatacaaaa gctagccagg catgatagca agggcctgta gttccagcta 38100 ctcaagagtc taaggtgggg cgatctcttg agctcgggaa gtcgaggccg cagctagctg 38160 tgattgtgcc actgcactcc agcctgggtg acagagcaag accctttttc aaaaaactaa 38220 caaacaaaaa aaaaactctt gttgagcttg tagttactac catctttaaa tttcctataa 38280 gcctctaccc atttgggggt attctgactt cctacacttg attctgttcg gggaagggtc 38340 agtccctgtt gccttttata cctcccttta gaaagccaag ttcactagag ttgccttggc 38400 agcccaagtt ttctcaaatg gccaaccttg ttttttgaac aagataatta gtaattgtca 38460 tgaagtgttt atattttaat ccttctaatg cctcttacat taagaaagct acatctttct 38520 taacatggga tcatatatat taatacattt tttctgagtt tctaaaaagt ctaaaaatac 38580 agatttcatt gtgtccagtc acctcaccta cacttgcgcc tgggttttgc tattgtgaaa 38640 tcttggccac ccggtcatgc ttatctaaga cttcggtcac ttcacacgga atctttcatc 38700 attaagttca aagaagcagt ttccctcaac tgcctttgaa ggaagagtca aaatatgtcc 38760 cttggcaaat agtcactggc aaatagtcat ttggcaagtt gagactataa ggaggatgca 38820 tatttaccct atctttaggt tagtgttttg tttggcattt taaaaggatc atcagcatgt 38880 gctacctgag tgcacctctg tttcattcac acttaggaag aaataggtac aggaggagac 38940 cattgcattt tgtatcaata aatttattat ttgcaaattg tacccattta tacagatagc 39000 tggccttcct tccttccttt tttcacctaa tataaacaga gcacctgcca cattctaggt 39060 agtattctag gtattaaggc cacaaagatg actaaaatgc aagaccttcc ctaaggaaaa 39120 aagtcttctg tagcataaac actcatgaaa tatcactact ttaagccaaa agctataacg 39180 tacaaatgta aattatttta agaggaagct ttggtccttc agcagaaccc aggcttatgt 39240 tcacaccaga gtagagaaag aactgcccgt ctcccagggc tctacttggt gctcagcaga 39300 tactggctga gtgaataaaa aggaaacctt caggatcaga gatgcctttg gatatcattc 39360 ttgcctgaga tctccctttg aggtcttgct attctaataa tatatccatc ttattttgga 39420 ttcttcttaa agttagtctt gtcatctcgc tcactgcccc actgtatacc taaagactta 39480 cttcaacact gattggaatt tttcaggttg tcctatacag aagtctagac atacagcatt 39540 ctgctgtcac ctcccccatt cgtattagca atccaggtta tttggcaact ttggaagcca 39600 cacctttggg atgtcagtca aatcaaacca aatttccagg tcctgggttt tgtcagattc 39660 ttgcgatcgt atgcaactct cctctcaggc tctgcacagc agcttttgtg ttgcacagaa 39720 gctgagagtg tgactggagt cagactgcct gggtttgaat cttgggtcca ccacaaatta 39780 tgctgacttc cacatgggta ataaaggcaa gttgctttac ttctttggga ctctgtgtcc 39840 acatatgtaa agggcaaata atagtaccta actaatgggt tattttgagg actaaataaa 39900 tgaatgaatg caaagctaac tctggctggc atatggccgt tgcttactct gcactgtaac 39960 tattttatag catcttcttc tgagtcggcc caagctagcc aacatggaaa gcccccagcg 40020 tccccgcccc cctttgttcc caccccatca gcctgcctcc agttcttcct gccctctaag 40080 gacactcaca ccaacagctt tccttttact ttttcatttc attgtaaccc tcaaatatta 40140 tctttctcct ccttggtttt agagaaggga gtttcttata gccaactatg gaataatagg 40200 ttgttttttt tctattgcag ctgctttata taggtattta ttttgtccca aattcaccct 40260 tacattattt ttctgcatcc catcaaaacc tttttcacat acagtagtcc taagcattta 40320 tacctgtttt ctcttaagga ttaggcctag tttccattag cttctagatt ttttttttat 40380 acagggtctt gctttgttac ccagattgga attcagtggt gcaaacacag ctcactgcag 40440 cctcaacctc ccaggctcaa gcgatcctcc cacctgagcc tcccaagtaa ctgggactac 40500 aggcactgcc accacaccca gctcattttt tttgtatttt tttgtagaga caaggttttg 40560 ccacattgtc caggcaggtc acgaactcct ggcgtcaagc aatctgcccg actcgatctc 40620 ctaaagtgct gagattacag gtgtgaggca ccgtacctgg cccttcaaga tttcttaaaa 40680 caaggccagc accccacaag tatttcttct gtcttcgttg atttgttgtt cttcattcac 40740 tcattatttt attcatctag catttgttga gtgcctacta tatcctaaag aagattaaaa 40800 atagtcctta cccttgactc acattggtaa tttgttggta atattagtca tatgttaata 40860 tgttgtaaag agacaacata atctagcata ttttaggcag tcacattagg attgaaatca 40920 aaaccctata gaaacatgtg cttcaaaagt atgtggcaag tatatgatac agaattcttt 40980 catttgaaca taacgtaatt tgccgtttta atcggtaaca aataacattc ttgtaacact 41040 aagaaaagta gtactccagg acatacagaa agtttacatt atccctgact cagttcctca 41100 aaaacttgtt tagaataaaa aatgttttta aaagtatgat agcaacaaag tttaaaacta 41160 aaatgatttt aaaatgatat aaagcaacta gtaacctaac actttgtaga tacaagcttt 41220 taacaacatt gctctttaac cagtgtcttt tgctaataga cgtaatatac ctgcttacta 41280 cataaactct gtatgaaagg catctgcatg aattaagtag catcagtctt ttggcatctt 41340 agtgtacaca tctttaaatc aagtggattt gatcagatta tttgtaagtt ctcttccaat 41400 tttaacattc tttaattgta gacattaata gtgctttttc tagaaactga tgcagacttg 41460 tagaattctg catcagtcag cagagaggaa agagccttca tatttttcat gaaatatgta 41520 tctgaatcaa acatcacaag aaggttgcag catcaaatat agggaggcag gctccttgtt 41580 ttatttcact gtaattctaa actaacacct atcagggtca aaattgggac caccatttgt 41640 aatgttgcat ttctttactt agtagtcttg ggaacttcat ggccatcctg aactgataat 41700 aggtttgtag gaattgccag catctaagag tttaaaatac actaagtcct cacttaacat 41760 cctctgtaga ttcttggaaa ctatgacttt acgtgaaatg acatgtaaca aaaccagttt 41820 taccagaggc taattgatat aaacaagagt taagttccta ccatatattt ctggtcacga 41880 aaacatcacc aaacttctaa ataaagacct aaaacatttc taatagtaac aattaaagta 41940 agcatgagtt atatatacat ttaagaaagg ttaacaaaaa acaagtacaa tgatgattcc 42000 ccatttttaa tgaatcagtg agtgatagtg gttgtagtgg tgctgtgttg aatgacatgt 42060 aacaaaacca gttttaccag aggctaattg atataaacaa gagttaagtt cctaccatat 42120 atttctggtc acgaaaacat caccaaactt ctaaataaag acctaaaaca tttctaatag 42180 taacaattaa agtaagcatg agttatatat acatttaaga aaggttaaca aaaaacaagt 42240 acaatgatga ttccccattt ttaatgaatc agtgagtgat agtggttgta gtggtgctgt 42300 gttgaatcaa cgaataaatg tttgcaaagc aaacattgta aggagcacct ccttctgcca 42360 tgcagttcaa acacatacca tacccagtgt ggcaggctca ctgaacgctg ttgtcctgca 42420 gtgtttattg ttgtacgttt gtgtgattat cacctgcttt aacagatttg tattttataa 42480 gaatttgtat ttattcactc atttattttc caacttgctt attccagtcc agggttgagg 42540 gtggccacag cccatcccag cagctctgca agctgcaagg taggaatcga ccttgaacag 42600 gacaccatcc cattgcaggg tgcacacaca cacacacaca cacacactct ctctctctct 42660 ctctctctct cacacacaca cacacacaca cacacacaca cacactcaga ttaggaccat 42720 gtagataccc cagttcacct aacgtgcaca tctttgggat gtgagagtac ccagagaaaa 42780 cccacacagc atgagagaac atgcagtctc cacacagaga gtggccccaa ccgagaactg 42840 gtttttattc ccctcctaaa agttatgaca caatgatgtt gaacaaaacc attgctgttt 42900 gaggacctgc tgtactacag caaagaagta tgcagtactc ctctagggga caccatccac 42960 agtgtgttct ccatggaggt tacgttgtat ttatatataa tgaatggcgc ccccgacagc 43020 tgtgtcacgt tttgagcata ggagcaaaca ttcctatttt tccagaatga acagagtttt 43080 actacatagt tgctatcgac cggccatgct gtacaggccc tgtactttgc cattgttcat 43140 cactttctct tcaatcgtaa cttctcccca tgtattagaa agcaatttta gatcacggta 43200 aagtcatgac gctgaaaaat taggaacata aagccacctt tttctgcgtg gttgcctcaa 43260 gtgatgtact aaagctttac aaacaaacat ataaacccct acccttctat gaatactgtt 43320 tttctactct ttatttatgt ttttatttat ttattttttt tttagacaca ggttctcacc 43380 ctgtcaccca ggctggtgtg cagtggtgca gtcacagctc tcactgcagc ctcgacctcc 43440 tgggttcaag taatcctctt gcctcagcct cccaagtagc tgggattaca ttcactcacc 43500 accacaccca gctaatgttt tattttctgg cagaggcaag gtctcactat gttgcacagg 43560 ctggtctcaa actcttggcc tcaagcaatt ctcccacctc agcttcccaa agtgctggga 43620 ttacaggttc cagccagccg cagtcagtct cctctttatt taaatgtttg ctatgatgat 43680 gcatttctct taagaaaact tcataattga catatgtgga tttaggacta ccttccagtt 43740 tttaatgttg tctatttctg aactagaaga tggtattttg agatgttttg gcaagtgcta 43800 gtatttcagg aaaatttcaa tgttatcaga gtaagttatt ttcatgctgt catttctctg 43860 ataatatagc agtcttgatg acagtggagg agtaggtaac aagaacattg attgatgcat 43920 ttagttagca aatgttgttg aatacccact ctgtaccaaa taccatggtt gtccctatac 43980 agtgtgtgcc taaagatact agggcctggg cctagagctc cctctattgg gaagctgaat 44040 gtagatgggc cagagcataa catgcaggag aatggtccca attcagggca gcactagagc 44100 atcccactcc taggaggcat agccagtctc cttctaatga atgtcctgta tttttattta 44160 aatgtaccag cctttgtttg aaattccatt ccattgcaaa gcagaggtag attacagact 44220 agtttttatc acattgttta aacttgaatg gagacacaac agttttcttt aaaggtgaag 44280 cacattcaag atcaaggtct gtggaaaaac tccttcttta ttgcaggacc agaaaaataa 44340 aggcaccaag gtgctcaggc tccaaaattg cagggaattt aatcccaagg gatgttgact 44400 gactctggga ggttttcagg gtgtttttaa ttcatctgtt tcccaacata attcatttgt 44460 agccatgacc taagcattct tgtgacagaa aaatgggttc acaattgaca tctgtgttag 44520 agaaacatga ctaataaagt cttctttttt caccattccc ttaataatgg ggaatgaaaa 44580 attatgacct ttcaagtcat ataattacaa atttctaaat gaaccattta gaatttctct 44640 tttaaggatg caaattagta atcagttgct ttacagtgta atacaggcac atgaggttat 44700 atttatatgt gtaataaaag gtttatgata atgacagagt agaaaattac accataaatc 44760 acattttcct tcttaaaaga tagcaaatta aatatgcaaa tattgccaac tgagattcac 44820 aaaagatgct atcttgaaat atttatgtaa aacacggacc catttcacct attcattgta 44880 gttaagtttt aattttagat tataatataa gtgtgttttg aggggacaga tgcgttctgt 44940 ttttcccagt gacttctgtg ttattacaat attcatgctt aaaggacggc cagataaaat 45000 aaaaatttgt tcatcattcc aataattaga ctctgcttcc catttcacca gtccatgtag 45060 aacaatattc aaacattgct gtactttggc catcaccata ttaacttccc ttctttaaat 45120 ttgattgtct ttcaaattct atatttaatt cacacaggtc tcaacttgct tgctttatat 45180 aattaagcat gctttttttc tttttaacct taaacatttc tatttccctt ttctctcgta 45240 agccctatat aaccttttaa aatcatttta gaagacttct attaggctaa taacttcagt 45300 aatgttgcta gtcaccattg ctactttcct ttttgctctt ctaaaatagt attgtaactg 45360 actagcagag gacagttaaa catcactcca gattctgcta acgggatact ttgaacactt 45420 caaatagtgg ttacttaaac cttaatatga gtattttttc aacacacaca ttttttagcc 45480 tgtgaactaa aaagcctatt atgtaagtat atccaaatac aggggtattt gtcagcacat 45540 agagaaaaaa catatctaaa tgtaacactg aatattttaa ttactatcat tttgtaaaag 45600 atgtactttt ttgaaagtaa tagtaagaaa tcccttatgc tgcatacaat atgcagtatt 45660 atatgatagc atctaacacc cataaaagta gttgtcatgt ggatatttaa aaacatgcgt 45720 tatgggatgg gcgcagtgac tcacacctgt aaccccagca cttgggaagc tgacactggc 45780 ggatcacttg aggtcaggag ttcaagaaga acctggcaac atttagtaga gaaggcgaaa 45840 ccgcatctct actaaaaata caaaaattag ctgggcgtgg tggtgcatgc ctgtaatcac 45900 agatgcttgg gaggctgagg tgggagaatc acttgaaccc aggagacaga agttgcagtg 45960 agccgaaatc acgccactgt actccagtct aaatgacaga gcgagagatt ctgtctccaa 46020 caaaagaaaa aaggacatgc attattatgg acatgcaata gaaataaaac atataatgaa 46080 aattggactg taaatttcta attcccacat aatgtaaatg gtcatcagtg gtttgaagtg 46140 atttataaag ttcatggaaa aatgatctca gtcattaatt gtaatagtta tcagcctgct 46200 gttcactcag agcctaccag atttgagtac actttagaaa tctatgcatt tcaaagttgg 46260 actttatttt gtttattatt accaaaacat ggagtcatgt gataaccatg aaatagcaca 46320 gtctatgctt tttaatgata accagtgcca tggtagagct gagaaattga gatctgtgga 46380 agagtggagg attcagggac cgtatttcag catcgaagtg ttctgctctg ttcatctttg 46440 gaagaaacat gccgtctctg tgtcttcaca gccgaggaac tatgttcaga acgtaaacaa 46500 ggctgcacct tgaactgtcc cttcggtttc cttactgatg cccaaaactg tgagatctgt 46560 gagtgccgcc caaggcccaa gaagtgcaga cccataatct gtgacaagta ttgtccactt 46620 ggattgctgt acgtatttgt taattcagaa aagctattat gaagtgcaac ttggtgaaat 46680 gtttctacac attttgtttt atatgttgtt tgtgaattaa tctatggtat attgagccca 46740 gaatgtctat acttgttttg ttaggtgatt cagaaagcag ggagaaggtt gtatctccaa 46800 aaaaaaagaa tcaagtttta cttttacttt ttatggacat gggccagaca caccctttgt 46860 gcctgttata tgagactttc ccatagcaga gtcctttaga cactgctggc aagattagga 46920 gatggaattg tgaggtccca gtgttggaag tgctacaact cttcacacct gtcccaaggt 46980 ctatttgtga tcatttattt aacacatact ttaagatcaa ttgaagaccg accccccact 47040 cacttgaaat acagtgcatc cttagctgcc atgcctaggt gtctcacctt ggaactagtc 47100 agcgtttcgt atttttttct ttatctagcg ttagattttt atgaattttc ttggagctaa 47160 gggccattac ttttattttt taatatactt tattgagtca aaccccatga gaatctagat 47220 tattccaggg ggggaaatag acacttcata tttgtttctt tcgtgtctgt ataagcacct 47280 atatttgcct ttatatcagg ttgatgcctt tagaaaaatc accttaattt atatttttcc 47340 ctttgctgtc tttctttcat ttcaatctgt ctttccaatt tgatatgttt atgtagtgct 47400 agattccttt tctttttttt tttttttttt ttcaggcagg gtagtgctct gccacccagg 47460 ccgtagtgca gtggcacaag catggctcac tgcagcctca acctcccagg ctgtcagtcc 47520 tcccacctca gcctcagcct cagcctcagc ctcccgagta gctgggacta gaggcatgcg 47580 ccaccatgcc caactaattt ttgtattttt tggagagatg gggtttcacc atgtttccca 47640 ggctgatctg gaactcctgg ctcaagcgat cagcctacct tggccttcca aagtgctggg 47700 attacaggca tgagccactg tgcctggcct agtgctagta attataaggg acttagacct 47760 atcatctctt tcccatcttc tgtcagtcat ctttttcatt cttcatttaa aaattctagg 47820 tttgggaggg gtcgtcttta taattttcat tggataatta aattggaaac attgttccgt 47880 aaaattcccc aaataaaata atgaatcatc atcattctta atcctaaaat aagaatatga 47940 tgtcaataag agtattatgt cttagcctaa ctgctcatat aaaatgaagt agttgtatca 48000 caatagtagg tagcttcaac cacttgctac agtatcatga cagttcagaa tcaaggctgt 48060 gttcaaatcc aaccgtcatg ttggagaaaa atctcttagg agattgtaga tttcactttg 48120 tattcctgtc agcacattca gggaggaccc agttggacac cttggacaac tgaatattat 48180 actggctaga acacatgtgt taaggacaca caccaaaaga gaattcttgg ccagacacca 48240 cgtgggagtg tcaggatgaa ggcagcagtt ggtgcccctt ttatgtacct gctgactaaa 48300 gaattacttt ctgcatgcag cagatacttc cagaagggag ggagggaggg agggacggat 48360 ggatcgtagg tgggtgggtg ggtggatgga tggagaaaca aatgagtaca cagataaacc 48420 aacaaaatct ccttcacttt cctattcatg gagttgcaca ttcatctttt tattgtggtg 48480 taccactctc cctaattaaa tcaacagtca tgtcattgag attctgagag ccacatccta 48540 gaagcacgta agtgcttggg caactctctg ccttggtcat gagtaagtgg tggtaatggc 48600 cacagggagc taattaccca tcactccagg aatctttgag agcaaaagtc aagtcattct 48660 gtttaatagc aatatcactt tattgaccct tggtctgagt gcttgggcat catttgagac 48720 tttgtgattt tctgacctct gacaaaattt taattaccca ggaagaataa gcacggctgt 48780 gacatctgtc gctgtaagaa atgtccagag ctctcatgca gtaagatctg ccccttgggt 48840 ttccagcagg acagtcacgg ctgtcttatc tgcaagtgca gaggtaagtg tgtacacatg 48900 gcccttcccc ctcaaagcag ccaggggcac cgaaacaagg aacataagga caccctgtaa 48960 tgaaatggtt tcggtggttg ctgcctattc taacacaaat gtcagtctct gtgggaccgt 49020 cccacaggac tcccttttcc tttcattttt atgaagtaaa agctcagaat agaatgtaaa 49080 gtaagtatat gatggacaag cacattgact aaattaatca cttcctagct ctttttactt 49140 ctatcaggaa ctttagccta aagcacaatt atctgaacct ctatagagct tctctctgtt 49200 caggagtaag attaataaaa ctgcaacagt aaatagcccc atttcaaagt tctagagggt 49260 gggtatagct cagaagttag ctgctgtatt tccagataac agttttccat gagaatttag 49320 cctttggctg aataagcaca tgccagccca cagctgtgaa taagcacaga gcctggatat 49380 ccctgctgcc tgaggcttct ccagggagct tcctgaagcc actgagcatg caccctgacc 49440 gccgtttctg cctttacccc agctccagcc ctgctgccta tcccttttta tgttgtactt 49500 gtcagcatga tgcacagatt aaactataaa agattcctaa actttgtttt acgatttttc 49560 aaaactggac agttaccacc ttaatggtgt ttctaagggg ttggagtaaa caaaaatcta 49620 gtcactgctt taattgttct gctgaacaaa actctgttat ttgcaagggg cccaggatgt 49680 ccccattgga tgaagcttct agagcgattt ttctgaatac attcaacata tttgtaaaat 49740 ttgttataaa atcagtgctt gtttattgta taaaatatga gaaagtacag tggaaggaat 49800 aaaaaatcag cgatcatccc acgaattgga gagaaactat ttgccggttt ttttctctct 49860 tttctttttt caagattgag aacatactga cttaaattct tttcatgaac atgtcccatg 49920 tcacaaactg tcctgtgtta gacgtaattg ttggtggctg gggtccagtc catgtacaga 49980 ctctgtagcc tgtttctcct 50000 23 50000 DNA Homo sapiens 23 gtgcagtagc cacttcttta ccaggctgcc ttttgtctgt ccagaggcct ctgcttcagc 60 tgggccaccc atcctgtcgg gcacttgtct caccgtggat ggtcatcatc ataaaaatga 120 ggagagctgg cacgatgggt gccgggaatg ctactgtctc aatggacggg aaatgtgtgc 180 cctgatcacc tgcccggtgc ctgcctgtgg caaccccacc attcaccctg gacagtgctg 240 cccatcatgt gcaggtaaaa gctggctgcc atctgtgtgc tccataagca aatgacttct 300 gccttgcagg gccctagaca catttgtaac tctggggagg ttggagggga caacccctgg 360 aacacttttc agcactgtgt ggtttttgtc aaccaccacc acagaatcat ccaggcacta 420 tcagtgaaga accaaagacc agagaaaatt ctatcattgt agaaatggct ctcctccttt 480 ttgaatgatt gactcagtta tgtaaggtct ttggactaca tttttgtaaa ggtttgtggg 540 ctgatgttct tacattccag ctctccattt tattgatagc atattaagtc tgaatgtcct 600 cattaatatg aggggattgg tactagtttt atcccataaa accttatcac atttaaagta 660 atgtttgaga gggagaaact ttttgaagtc tgtggctatt gctctttttt aaattaatag 720 acatgtggca ctcttgccaa atggagagat ttagagtgac cacacgtcca aaacatttaa 780 aatcataatc atggtaatca ggataagtta ctagaaaacc agaacagatt tgtaattcaa 840 gggagtctga attatgattg tcaataacag taattataac cataaacttc atctagtacg 900 ttttatccaa agagctcaac acagtcttat atctattatt tccctctctg tgcaggccac 960 tttgagttag ggaaaaacac attgaggcag tttctcattt ggaaaacaga cacagaaaga 1020 ttgactgtct taccaagagt aaaatagtat gtccaccaaa aggcctggga gaaatgaagt 1080 ctctcaaatc ttgtatcaga tttctttcaa agttaagtct actgtcagat tttggaggat 1140 gattgttttt cctctggtca gtccaggtcc tgtatctccc tgtgcgtgag tccacctgtg 1200 caaaaggtgt agccgggagc ctgaggcctg gtccttgtgc aacaagacta gcctgggagg 1260 gaaactgacc ttgttcgcca tggttgggga gcaggggaga gggacacttg aagtcccttt 1320 cagctctgag attctgcaca gctcaatagc tacagtatca gagtgacagg cagctcccat 1380 accttgtaga aaacgagatt acaatgcaga agagatgact ctttttattt tgttttgttt 1440 ccctgtagtt cagttagagg tgctccatgt cagaactctt tccttttttc cgatattttg 1500 ctagttaagt gtgaaaaact gctagagtcc cctgcataag aaggtaaatg aggactttaa 1560 tatgatagaa ctctgtgttg actctttggg gagggaaaaa tcatgtgttt gcatgaagac 1620 ccctgcttta attataggtg tgaatgcata gaggagggat ttgtacctgg gtctttgcta 1680 aacatgtcct caatgaagaa ttctaacagc tagattggta gaaatgcaaa gacattaaga 1740 atggataatt gacctgagaa ctgaaaatca gtagaaccag aaagatcttt ctctattaca 1800 gagtcatgga aagttcttac gtaagcacac gtgaagttgt tgttgatacc ttactgtggg 1860 tgtgtcaaca tctggatata cgtgtatgaa aacacatttc tgtagatgga tgtgtagagg 1920 taaatgtttt ctgcaaatct ttagcatgtt gccacataag cctccttctg attagagact 1980 ccagagctgt tgcattcacg gactcctact ttaaagaata tttttctccc ctcttcattc 2040 caagatcagc tgtttttctt cttgggatac atgagtatct cccgaactct tctaatgaat 2100 atttactaaa tgatagtgtc tctcagattt tctgtcttcc atacactcta catttgagtt 2160 tctcttttgt ctccatgtct ctggcccctc ttgtgttttg ccttgaggca tatcgtacgg 2220 caaaaaggtg agccagaaag catacctgca aacacgcacg tcagatttct tctgaccact 2280 tccgtacatg tcagcattta acaaggcagt aattgtcagt cctctctaag ccctttttga 2340 aagcaataag ggaggctttt gctatattca tttatcgata ctctttggga ctttttgttt 2400 ctctgtaaac cacagatgca catttaccat cttgttcagc acagttttca ttgtaattct 2460 cttcctctct cttgttctta tactggaaag ttggttacag ttgcttgtga atgtaagcat 2520 tgattgatga gtatcagttt tctgtgatac tgtccctggg tctaagttcc agccttttgt 2580 aattcccttg agataaagct gtattagaag ttgttaccct cttaactgga aatcagccca 2640 catcacttaa tttcattagt attgttgact ctcatgctaa ccaccacttt gaaagtcata 2700 gccccaattc actcctaaga agatttcagc tctgagcatc tctgcatata tgggcccatt 2760 ctggggcgtg acttccactg agctccaagc atgagctagc cgtgatgtct ccagactctt 2820 tctcggcggg attcaacact gtcaacacat tcctgaagtg cttgttgcat ttggattacg 2880 tgactatggt taatgtcttc atacttctca tttccccagc aaaacttcag ggcaactccc 2940 agcccctctt cagacatttt caacattaaa ttccctctct atcagtgtca tctctccaag 3000 agaccaaaac tacctctcct ctggggaaag aaatgatagt tttcaccagc ctgccagcat 3060 ttacctctct tcatgatgat attgccaagc acatgtttag tctcaggtga gggttttcag 3120 actgttgatt ctacaggagc attcctgctc agtttgctgg cagctggctt agattggggt 3180 gtggcctgct cgacagcagg acatatttcc aagcataaaa gtactcagga tcccagaaat 3240 aataaagcag atagggcatc gatcatttgg aagttttatt taatctggag acatatccat 3300 gatccaaaca accatagact tgggttgcag tttccctcgt ctctccctga gctctcagtt 3360 tcctcatgca taaaataagg tgactgtaag gaataaatga aataacacat gtgaagtgct 3420 tagcatggaa ataactgaac cgtaccaagt attctgtaaa tgagagatct gaatccgaga 3480 cagttcctac acattcacag aaatgccaga agaggcccct gttctcatct ctagagtggg 3540 agtcccctcc ctgtaattta atgatcttat tgaattctct tattactgtt gacagactta 3600 aaccacttga acctcacgtc ctaatattag ctcttttgcc attgtcttct gtttccatgc 3660 atggaatatt ttcttcacta ttctcttaat aagaatagag cgagatgctc ttcacagttc 3720 atctcttcta tcccttaaag caaacagcac caggctttca agagttggaa agattgcaga 3780 tgaataatgt gttctgattt tgtgtcattt cccagatgac tttgtggtgc agaagccaga 3840 gctcagtact ccctccattt gccacgcccc tggaggagaa tactttgtgg aaggagaaac 3900 gtggaacatt gactcctgta ctcagtgcac ctgccacagc ggacgggtgc tgtgtgagac 3960 agaggtgtgc ccaccgctgc tctgccagaa cccctcacgc acccaggatt cctgctgccc 4020 acagtgtaca ggtaagcgac accatgcctt gtgggtgctt ggtggggaag gatgcagctc 4080 tgggtgttgt cattctgtgg gacccacagg gcaggatcag ccccatatgg gagtgtgcca 4140 aacccagtat cccatcagga acagctggca gcctgcttcc tgcatgtatt ctctttatca 4200 accacctatt tatgcagtca gcatgcctca cacctgagat gactgtgaaa caaaaaagga 4260 gactgttgct cttgccaatt tttttcctta accctgcatc tcaaactcga ggcgtatacc 4320 taagtagaga aagtcccacc gcaccaaatt aaccaaacga ttctgttact gtgaaaagtc 4380 atctttatgt aacaaatgag aagaatagtt atttataaat aatgtaaaca aaattctttg 4440 taaataaaat tctttgaaat ataagggtgg cgaagagact ttgaaatatg cataatgcca 4500 ccccctccaa gttgggatga ttagtaagtt aatgccaaaa agcattaagt gactagcact 4560 gcacatgtat aaggagtgct attattttct actttttatt tctcagataa tctgacctta 4620 gcaggtaaat gtatggccat gaaacttgtt agaaggaaat ttgtcaagat gttaatcata 4680 agttactggt tttatataac tttagtagta cttttaatta atactagatg tgactttttc 4740 aaaagatatt taaatataaa attatcaagt tgacaggact agttttgttt tgttgttgtt 4800 gcttcttatt cccatatttc aacttaaatc aataaccatt tactttctaa acatattctt 4860 agagcatttc tgctcagcaa aacataaacc cttgtatttt attacttctt ttttttaaca 4920 ttggattatc tagtcagctg gccagaatct gttgttccgt cattatgtgc agcatcatca 4980 gtgtctaaat tttttataag tttatgttga tggcattttt ggaattgatc agatgaactt 5040 atagggatag ccacattact ggctgttgtc atcatgcacg gcagcccaag tattaccatt 5100 ttagtacaga actgttaatc tgagaagctt aattagcact acagcttttc tagtataaga 5160 aatagttcat atgtgttaca tttttaggat atgcatgacc tatgatttcc taatttgttg 5220 tcatcacaat tcactcagat tcaaacccta ctgatgtttt ttgttaatat atagcagtga 5280 ctttacttac atcaccttgg tagaaactac ttagagtgat cttttccttg gtgataatga 5340 atttatgtac ttttagctat ttagtttaca aatatttgta gaatgcctga attgaactat 5400 ggggattcca aggttaatga aaacagacat gattcctgtc ctggagcaat agccgaaaag 5460 ccaaacaatc caatattcca cgttaagtca ataatgataa agtaagaggt gtgccttgaa 5520 agggaggatt caagaacata ggcaatccgg gaaactaaag cagcaaatac ccctggtgct 5580 atttgtgtta ggatctgagt gatgaactta gggattaact aagggagtgg cactggggga 5640 ggaaatgcct ccaggaccac tagctgaaat tcattgatgg gacaccttac tgaactcaga 5700 ggttccaccc ataaagagaa ggagagctat gtggaggtgg gaacctctga gtatgctaac 5760 atgggtggca tgtgcagagg cccatgggtt ctagccaagc aaggcagtgg tagtcctgca 5820 ttcctttcca tctttccaaa gaacaggcaa cccctcatgt gcctggcttt gacttctcat 5880 tcattaactt aggcctggat ttttttcttt acctctaact aaaggatgtt tctggacttt 5940 ttcattcatc gttgtattga gatacaaaaa aataaaaagc cttaaaactc aaattatcag 6000 tttcttccca tcacagtaat tctcaacagt ggcattgata agtggcatat tacacatgta 6060 gaaaaatatc atctctaatc ttaggacact tgcattctaa aatggcccaa aagtcatagc 6120 tccacagtgg gataacagct tccaaagggt cgaatccctc atgatgcttc catctgtagg 6180 gtgccttgtc atttcctctc ttcagctttc cctcttctct ggcctggttt aaagcaaggt 6240 ctggtgagga aatgaggaaa gattgcacat cactgcaagt gaagctactt caccagcata 6300 caaaagggag agggtaaaat agcacttgtt tctttccata agtttaaaat tttctttcct 6360 ttatttttag agttggaaaa aagcatggca gagaagagcg agggaggtag aaaatagaaa 6420 aaaggaaaca ctgaaaacag tgaaaacact ggtaaagatc aaataaagga gtaaacgttt 6480 ggaaacagaa gtgtaaagaa aagggtctag agcaggaaaa atgagaaaag aggcccaaag 6540 aagggagatg ctgtgggtaa gattccaaat gacattgtcg gtcagggttg ggccaggagc 6600 ctcctgcctg gtggaaggaa acctgcagac tctacctcgg ctccagccct cacaccgtca 6660 ctgccagcgt atcctgatgt cctctgggtt atttccaggc ctcagtgaga gccctgcaga 6720 ttccaggctg ttcgtttcac tgtcattcca tagcttcagc acggctgtcc cttcttcttc 6780 ttttcagttt ttagacagag gttgggcaat gcttgcatga gctaagttaa cgtaagtcac 6840 aaaaacctca tggagatttg cagggcctag tgtgaacaag agggtgttgt ccttatttct 6900 agccaaagca aaagttagca ctggtcagtc tcttagagaa gaaattatcc ctcgacgtag 6960 ataaccacga aaaggtcatt gagaaagtca tcccatctta ctatctctaa cctatgtggg 7020 tgggatctgt ggaatcatat ctgtaatagg acctagagta ccatggagcc gtggaaccaa 7080 tggctgttct tcaacttatg cctgctgcct ttgggactag tcataggaga tacagttact 7140 cagtttaaag gagagggtat gggccaatcc cattatgctg gctttcctgt ccctcagaga 7200 aggtccacaa tgaagtaact gccttggtcc tcaaaaaggc ccttcttgtc acatcctgct 7260 tactggctgt ggggtatatt catgaggtgg atagaaggaa aaagtggcat gtagagcttc 7320 tggatcatat ggcaactaca tgtctcagtt gtttctggtt cttgactaaa aataacagag 7380 ctcatgaagt tggttctggg aagaacatac gacaactctt ttgattgtgt atgcattgag 7440 aaaatgtggc cctcttagag ctttgctagg tagggctgtc gtggctgtgt ctagactctt 7500 gagtggtgaa gttgattaga agtgctctct tatcagttgg actatgacat ttctaaactg 7560 gaaacgcacc cacttacatt ttcctttctg atatctcaat catattgctg cttgaagtta 7620 caattttcat tgattttagt ggaaaccgat gcccaaagtg ctatctggat tttctggtga 7680 cattttctgt tgcaagttgt aactagatct ggggacttcc aggtctcagt ctttagatag 7740 caaagacaca gatgatttgc acatgatttc tctgtggttg catccttatg agtgatttag 7800 atttcttatc tttcagtcat taatgtgaat tcctgaacat tggaatgttg catttaggtg 7860 gtttttggta gtctggtaca atgagaattt gtctgaatat tgattggtca aaattgcatg 7920 tggccacaag taagagaaaa tccagctaca atttgtaagt aaatagaggt ctattttaca 7980 tatatgacaa gacatttgga ggcagatgtt tatcagcatt gattcagtgg ctcagtgggg 8040 tcaaggcctt tctcttctga taggggtgtc acagccgtga tcaacgcact cctatcaagg 8100 caggaataat gtggaaggat gatggcagaa catatctacc cttctttgaa gaaagtgaaa 8160 gctttcccag aaatcccttc gttagccatt aggtcccatt agctacggct agatgatggg 8220 gccaccctag tttcaaggga gcctaggaaa gcagggaaca gaattgtccc atttacttac 8280 atgcattgta atccaagtcc tgggacttcc ccaaagaaaa ccaggatttc attaaggaag 8340 aaagagcgga tgttatataa acaactaaca gaataactgt aacatctctt ttttgatctt 8400 tgttattttg tttaaaaatt taagaccctt cccgaatcat actttctgat ttgttactgc 8460 gtcatgtatt taaggaggca ccgaaagcca tcatgactgg gtgtgctttg aaacacacta 8520 tgttgcatag ttgctggcca acagcatctt ctttgctgac ctatatgttc atttttccag 8580 atcaaccttt tcggccttcc ttgtcccgca ataacagcgt acctaattac tgcaaaaatg 8640 atgaagggga tatattcctg gcagctgagt cctggaagcc tgacgtttgt accagctgca 8700 tctgcattga tagcgtaatt agctgtttct ctgagtcctg cccttctgta tcctgtgaaa 8760 gacctgtctt gagaaaaggc cagtgttgtc cctactgcat aggtaagacc aaggaaaaaa 8820 aaatccctca tctctaccag ttgtcactaa atctgtattg acagccattt gctagatcaa 8880 ttaatatttt tgaaactggt tttttccctg cttgacatgt cacacagacc cagtgtagca 8940 acactgagca gaagaaagat atggaagatg tactcccaag aggaggtgct gtaagaataa 9000 ataaatgagg ccaggagtgg tgtcacgcct gcaatcccag cactttggga ggccgaggca 9060 gacggatcac ttgaggtcag gagttcaaga ccagcctggc caacatgatg aaaccccgtc 9120 tctaccaaaa atacagaaat tagctgtgtg tggtggcaca cacctgtaat cccagctact 9180 ccggaggctg aggcaggaga atcacttgaa cacgggaggc ggaggttaca gtgagccaag 9240 attgtgccac tgcactccag cctgggtgac agagcaagac tccatctcaa aaaaaaaaaa 9300 aagaagaaga agaagaagga gaaagaaatg ttctctgaga atgtcatact gttacagcac 9360 tgtcgtgttc tatcggggtc cttggccagt tcagggtttt caatttcaga actctgtttc 9420 tccagcctat tttttacagt ggtttttttt ttcttttatt agacagattc tcactctgtc 9480 acccaggctg gagtgcagtg gtgcaatctc agcttgctgc aacctccgcc tcctgggttc 9540 aagtaattct cctgcctcag cctccctagt agctggaatt acaggtgcct gccacctcgc 9600 ctagctaatt tttgtatttt agtagagatg gggacagtgg tgtttttttc cagatagaaa 9660 caccaagtct catattagag cccacttagt gtataaagct tgctgctgcc ttgagatcaa 9720 ctaaatatga aatattaatt taatctccca gttgttcctg tcttctggca cttaatggat 9780 tctgtttcaa ccacgtggtt gtccctgtca gtggcctagg cacacagagt gaggcctgag 9840 taactctaga gccctcacct tcttgtgacc cgtggtggtg atgcatcttt gtgagcagag 9900 gatctatgta tattagcacc taacatctgg agcaaaacta gtcctatgga cataagccag 9960 atgctgttcc gaggtccaaa aggtgagaca gctggcagtt ctatcaagga cagcccgtgt 10020 tccctgggct ttcaccctct tttaacgctt ttgtgtccag taaagctcaa agtcaaatga 10080 aagacaatga cacgtcttaa tcagatagtg cacctataat cagagtagca gatgaaaaag 10140 tttacagtct gtactaattg cacatttatt gttggggatt ttccatgttt tctaaacaca 10200 agtcattgca tatatctgga gtgatagatg gggaataaag gagatctggc tacacagttt 10260 tgtggcattt ctgatagata agctgaggaa tggtggccat aactcaagct catgaggaca 10320 tctccctgtc cttgtcttgt gacttctggc cactagacag ggccactctg agtcttttac 10380 tctgcgtgtt ggtatatgag atggaaatgc ctgttcttag gaattttaca cagattttct 10440 tactgtaatg gtcaaagagt aataaaagtt ctgttttaaa agctagatgg ccaaagagta 10500 ataaaagttc tgttttaaaa gctagattct ctcttttaat agggttacca tccatataat 10560 ttattcttca aaccacgatg cttttgagag tgaaaagggg gctgttaata attctgatgg 10620 gacaatagtc atacatagat caggagtgtc ccaagcaaac tggatatgtg gtagccctgc 10680 attttagcac cggcttctct gtactctacc cctaagcttt ctagctccta agtggagcga 10740 aaaatgtatc aaggcatata tccaatttag tgctacttac cttttcaaag agctgaccaa 10800 aattcacgca acaaaatgac actgcctttg gtgagatcta aatgaaagct ataactaagc 10860 tattctttca ggctgggtaa ggggaaagta tagccaatat ccccatagtt atatatcaaa 10920 actttttttt ttaacttttt aagttcagag gtacaagtgc agtaaagctt taatactaaa 10980 ggaaatagtt cacaagtcaa ccagggatac ttatgtcata cttgggaaac cagggcccag 11040 gaggtctgag aattgcatgt taccttcaga ggaaaaatct aacagcagat ctctcgtgtg 11100 ttgaaaattc tcagattgtt ttgtgtaaat ttattgttgt tttgctactt aattgttaaa 11160 tttgtaatta gtaaattatt cgactagtaa atttgttttc atgtgatagt agataatcat 11220 ctctcacaaa accaatttaa caaattcaaa attatgaaga gaaacctgca aatcgtagta 11280 ttctgtagtc tttcttttta ttttttattt ttttgccttc atactttcta ttccaattcc 11340 tgcatctaga ttatacattc ttcattggtg tagaaaaaaa ggaccagttt ggctcaaatt 11400 tgtgattttg attttaagtt ggactgcagc acttttgcta aattgtggtt catatgagct 11460 cgtaatggta tagtggacat tttgattctg atctgtctcc ctccacaact tccctccccc 11520 tcccaaactg atgcacgtat aattttatca tataaaccca tgacttagct tcaggggacc 11580 atgttgggaa gacttaacat ttgatgaact gtaaccgtgt accttattca ctcgttcatc 11640 ttaataattt atatgtgttg tattattgaa atagtcatta agccctcttc tgtctgcatc 11700 atatttactt tgcaagtagc gttgttagat aggccttcat cctttcttat ctagccctgc 11760 tatatctaaa catacgaagt caagttcatg actatgatat agggatttcc tgtctgactc 11820 tccggtttgc gtttgggctt gttctcttat caactaagaa gtcctgactt ccagtcagct 11880 tggtccctat ataagggtgc caagcctggc cagcacaagc acacccaaag gcagttgcca 11940 gctgtgcttc attggcagag cctttatcac catccagagg tcactgagca aaggaaaaac 12000 atggaaacac accatcaggc ggcatgtctt aaagcaccca attgtttttc cagtgggcac 12060 caaagaccag tgctcgcatt gtttcctggt ttaggctttc tcagttttaa ctcagattat 12120 catctattct caggatcttg tgtatctcac tgttagaatg tacagactat ctcaccccag 12180 cccacagtgc tgacctcacc ttggctgtag ggctcttcca tggatgaatg ttctccctaa 12240 ttagtcaaat acagatctct aaaaacctca aactcacaat taagggacct cttttattag 12300 tggtcaaaaa aaaagtatgc atgtgaagta tccttttttc ctactgtaag aaaaatgaaa 12360 agtggtgatt ctaccaaaaa tagttttctg attacaagta tttgtgttcc ttgtgtcaaa 12420 ttgtcaagtc actaatgagc aatgataatg cttctctttt gatgttcgag ggcctcatat 12480 tatgtgtgct aatttaatag gatatgtaat atagacataa ttaataggat tgatgattaa 12540 ttttgaaaag tgaacagctt tgtctagttt cttataatat ccatcattag ggaattcatt 12600 gttctactag aaagaaacta agtagtaatt acccattcac tttaatggaa catcttctga 12660 ttccctaatt tttttttgta ctctggtttt ctaagtaaat acattcaaac ttagatttta 12720 ttttcataag ctgactataa tgaagggtta aaggatgaag aacaagaatc aaaagttgac 12780 aagtatgacg caggtggaga taaaaggtct ggtgggaatt ctgatgggtg aaaagagatt 12840 tcatatgtgg ttccttgtaa acagcaaatg cctctgttat gacctactta gccatgcaag 12900 agtgtactcg tttctaaagc atcccatggc catttcactg atgacatgct acatgccttt 12960 ctcaaacctt gccagagcaa aaattatttc agtgctttca gtgaagtctt tatttcatga 13020 cttcaaggag gatagcagag aagtaaatta gcagccctca taccacacca ctcatatgcc 13080 tgtggggata ctcatatccc catatgagcc ctgaccacat caaccttgct ctaatgttcg 13140 tgttggtgat gcttctgccc atactgagtg ccagactctg gcagcttctg gaggggggac 13200 cgtgagtctt tcagtggatc ctccctgtgt ccaggggtca gtcattcctg gtcagatatt 13260 aaaaagaaaa agtactactt tgatcgtaac gtctatctta ggttaatttc caggattttc 13320 aagcccactt cattgctgca tcatacctgt tgactttgga agtcaagatg ctcaggtatg 13380 tcccccagtt aagccactct aggtgatcag gactgacgtt gaataaatag ctcatgtgtt 13440 ggctaagtct taagtgttag tatgtagaac attgttttga aaattatgac ttggcaagta 13500 tttgttagta actacgtgtt attttagaag atatttcaag gtcagatttt taaagtcatt 13560 tgttggctag aaattcttga gtcagtgtat ctttctcact tcagaaattg tggatgattg 13620 tttgtgttta ttatatataa tataaatatt tacttataca ttatataaaa taaatacaag 13680 tatatatgtg gctcataact tccatgtgaa ctaagttaaa atgttctgat tacatgcctt 13740 ctaatcagcg agtatcattg aagtgttttt aagtgtcata tatctttaag gatctgtgaa 13800 aatgagttct attataggta gatggtacac taaaaatttc ttacaggttg tattaatacc 13860 tctgtttgat tagaagttag ctatttttat atttatatat ttactaatat tcgaagtggg 13920 actttgccat ggagctggct tagcttgtgt tgtgactgta tttcttctca actggctgtg 13980 atactgtcac tgtatagtga tcttctccat agtgatcttc aactcgcaaa tgattaaata 14040 atacttaacc ttaataggga aatgtctata ccttatatca agtcaataca ttattccttt 14100 tgcttagatt tctgtttatt taaacataac tctactaaac tggtatctta gagatatata 14160 gcatctcttt ggaacatctg ttggtcattg tgtatttaat tcagcccaag aatcaattat 14220 agtaaatatt taattcaggc atctttaagc tatacaaccc atgaaactta atgttacgca 14280 tttcgaaacc ataatgatat gatggtgccc accttcattg gaacattttt caccgctgct 14340 tctccagtgt attttcttaa aattaaaagc ttctgcacat tggaaagaga acttcccttg 14400 gagccagagc attataatca cagctctgtc acattctagc tttatgtctc ggtgatatga 14460 tgtaacccta gttcagtttc ctcatctata agatggcaat aataatattg tcctgtgctt 14520 cacaaaagtg ccaagaatat taaatgacat aatgtacata tgcaaaaata ctttataaac 14580 agtaaacaca atttaagtgt atattgattc agtagaattg ctgctctcgt aacacagatt 14640 gcagctgatc tgagtaacat tatcaaatag attctagtgt ctcactgtta aatgtagcgt 14700 ctggctttct acaagagaaa tgcctggata taacatctca gtgatgagct tttcgaggcc 14760 gagaatatct tagataatgc tttggcagtg tacctcatcc tgcctgctcc catcttaaaa 14820 agcccatgtc actttggtca ctccatcgta acccaggtag aaacagtgtg ctttctgtag 14880 ctcagctgtg ttgtgacagg gccttgccca caaaccacac tgttctttca tacctctaga 14940 cattatcaga agtgagaaaa tagggctcat ttattttggt ttatgttgag aggctagtcc 15000 catttgaaga gtctgaactt actgtgcatg atccactgtg attcctggga acagagtacg 15060 tggcatttat catcagggcg caccctcata gggagatgta tacctcctgc acattcttca 15120 gtgctgtcag ttactgtgga cctacacgga ggcccaggtt gtcacagtta gtgctaaaaa 15180 gattgaaaca aaagatgaga ggtgtgattt gaaagaaggg gaaacaggat gattttctgt 15240 gtattgttta gagagtataa aaatgaaaac attctggttg tcgtgcacca tgctaacttt 15300 ggaactggtc taaagttatc acaacaaaaa taagcagtgt ttgatttcag tttgtttggg 15360 tcttacccta tgccctagga agcaactgat cttgctgctt ccataagcag ctttctcttc 15420 tcacttcagt atcctttcca ggcccccggc tgtgtaaaca gaaattcaag ttgttctggt 15480 taacttggga aaaaccccta agctgcatag tctttaggtg tgctccgctc cagccccacg 15540 ttcttaattt aaagagctct gctgcacgga agttaggctg ctgcactgct atttccagag 15600 agatccaaca gtatctgcct ttctcttatt ttggtccagg aggccaaacc tgattttgta 15660 ccctctagtg gtcaaggata tgggggaaga ccccatcaca agagggccca cctgataaga 15720 accctctgga gactgctggc agagtcttct taacaggact tggactgtga ccagataaaa 15780 catcttctat aggtgcagaa actggccaat ctggaaggtc caatggcttc tacgttaagg 15840 ttttcttaac aagatggtat accccgactt ttacatggat tgcttctttt aagtcctaaa 15900 agaagttgta tttcttcctc cttatttggg ccaccttgat ctttagttgt ttctgcctgc 15960 cttttaaaat gacattttta tatttccaaa atgatttatg tgaataatgt ccatgtacct 16020 gtagatgcat cttggattgt aggccctact atgacatgag accacattta aagttgaaaa 16080 taaaatgttt ccttcaacaa ctatccatga agtgcttaat caacccataa caaatgctta 16140 acacaaggag taagctctat tactgagaaa ttgtgtcctc tgcatgcagt ggttctcagc 16200 caaaagtgac tgtgtttact agtttggtga gatttctttt tgtagaacta ggacttggaa 16260 gcctgagcaa tcacctattt ctttctagta tgttgaagaa atctgatagg ctcttttaac 16320 ctttataaat tacttgcact tcctacccta agaaataaat gaaataaaat aaaacaaaaa 16380 tggaattcta agaaaagcct taaactgtgg ccaaatctgc tttcacagca attaagattg 16440 aggcagtatg gagaggtagg agtcagccga catgacagca tgtacctcat ttttttaaaa 16500 gaaagatgga ttaagatcag attttttttt attggaagct aaaagttata aagtaaaata 16560 cccaaagatt taattaaatc agtaagcatt tgttgcattt ttaaccagaa ttctttaatc 16620 atttcctgac agaatgactg tgtattagat ctctgcctca ttgaaattgt ctaggccccc 16680 tgcatctgtt ttgagcaaga cttcagttga gttccatctt ttcattcttt tacagtctgt 16740 aaaggcctgg tgagtcagcg cattgaaaat cagtcatcat cttctgtcta tgagttctta 16800 gtggcccgta cattattttc tcagatccct gcatcattac aatttccatg tgtttaagtc 16860 tcttctttat gttttttaaa aaaacaagtg aagccagttt ctcaaacagg aggtgctcac 16920 attctattta ttctccaagt gtggagtttg tgcccagcat gtgaccgaca atacagaatg 16980 aaaagcacat gacaagccag ggatttcagt gtcttgttag aaagtgaaat gaaagtggta 17040 ttttattcct ttgtggctta gagagtgctg taatgcattg ttacaggtat aaactaacat 17100 ttttagaagt acttccccta ttcttgccca ggtcagctat gtcattggta atacttggct 17160 gccttttcta ccgctcccct ttctaaatct catgttaccc atagtataca tgggcccttt 17220 gtatgcctgc ctaaatgact tcactgaagt tttctgtaaa tccctcaaac ctcaaactta 17280 agaacccaaa ctgtaatatc atgttattag caggaccagc tgtctgcgcg tcttttccag 17340 acactgggct catccatttg gatatggact cccacaaagc tgacgagacc ctctgggtgc 17400 aggagcgtct cctgaggacc aaggccttga atgccttgtt cacgtctgca tccccaacac 17460 ttaggatgac actcgggggg ccttgatctc aatgtgtggg aagaacaaat gcccccgtta 17520 tagtactgtg ttggcttgtt ttgcagtgag acttattctg agattaagag ctgcatttag 17580 cttcttggcc accaggatcc cagagtaaat tacttttcag gactctaaat agtacgcagg 17640 agatttgaaa gaactgatgc aatttctaaa gcagcctgac ttcctcccag gaggttactc 17700 ccacaccaag cctctgactt catgaccacc agatttagaa attgaagtat ctatgtaaga 17760 agttgcctcc taggcagaaa tcaagaaatc caactataac ataggttaga gtccattttg 17820 gtttttatat ccttccacag aggaaagagg aggaagaatc tggagatgcg tttttggttt 17880 ttggtttttt gttttttttt ttttccagag gctcatgtat atcctacatc atggtcagtt 17940 tcagagcagg gctgtgccac catctcagtg actcctggaa tactaaattg gatctttgta 18000 gaggaagaaa ataacacagt tctagatttt ccctagctgt taattagttt tatggcataa 18060 ttaaaatagc tcaggagtaa aaacaaagtc cagccttaac agcctgttaa gtcttctttt 18120 cttatcttga aaagaggtaa gataatgaag tttaaacagt tgaagaagtt aaccggaaag 18180 gaattaacat ttcaaggcct tgccgcttct tcctcctctt gtgatatgaa ccagaattga 18240 gggaaaatag gcaggaggga acccacactg aattttccag actctactgc tgaaagacat 18300 tgtatatttt tattgtaatc atatgtgatg caagataata ttgctcatat ctgaatccca 18360 aaagaaaaga agatgtttgt ctgagcatcc catgaggtaa gcagccccat ggaaggacca 18420 gctgcatcca gcaaagggct ccaggtccct gacgtagttg acggtgatgg cagaagtaaa 18480 tctttgtatt cttgcagaga ctttgtttct gaaagaggcc aaaagtcatt tcaaactgaa 18540 tctgacggat aagatagggg ccatagctga gaaaaataag tagcgaagcc gatctaaacc 18600 aaggtgtaac tgtgaaatag taaccatctt ctttgcacgt ttcaaaattg gccctggacg 18660 ccatcgcaag aggaaaattc caaagctgtg cagcagcggc agtaccattg gaagagttta 18720 cagcctacca ggactactta gaaggggatc atgtttattt ggatatataa cccctgattg 18780 gtttgtttta aaaataaacg ttattatgtt agtgtcatac ctcatggttt ccttggaaca 18840 ttctcctcta atcctctttg agagagtggg agggagagat tgtgagggga gagagaagtg 18900 ttaaacagag atgttcctaa ggaggaccag gaatcctaaa ttacctaaat aacccacttg 18960 gttatctcca ctccttgata gcactgtgtg aggtatgatg cagtcaggta aaggacaatt 19020 tatctttgaa gacaaagata aattcgagtc cccattttca agagtcagtg agaagtaaca 19080 gcttgtttgt gtggcactga ttgatccttg tccgggcaag tggtccctcc acaggttatc 19140 cggcttggca cacaacagac agaggtgctg gcggactgtg gaaccagacc cgctgtggtt 19200 cccctcctca ccctgccact tcctagctgt gcatcttgga caactgattg aatcttgtgc 19260 ctcatttttc tgtagaattg aaacaatacc ctgacccatt gggcaatgga gatcaaatgg 19320 cattgatgca ggtaacatgc ttaacacagg gcctggtatg tagtgggcat tcagtaagtg 19380 gaagttacga ttattattag gtatcaaaga acaaaactta attgtcagaa taagcaaacc 19440 agttggaata ataaacgtag cctttagagt taaatggcac aaccctgata gccacacata 19500 aactgttttg gttatttttc gttggatatt cccagtcttt atttctacat ccctcttcag 19560 gctgagtgct taaaatacat ttatatgtag tctcaattag attatgctgt ttcccccaaa 19620 agactacagt gactattatt ttaatttata gaacctcgaa ttgcatggaa tagctaccac 19680 atctcccttt aaacctggta gcttccaaaa cttcctgatg gcgttttcgc aaatgggtcc 19740 ctatgtgtca ttgaaggaga cagtgtatct aggaaaagaa atgaggatac tgagcaaggt 19800 gttggtggct catgcctata attccagcat tttgggagac taaggcaggt agctagcttg 19860 aggccaggag ttcaagacca gcctggccaa cacaatgaga ccccatctcc acaaaaaaaa 19920 aaaaaaaaaa ccctaaaaat tatccaggca taatggtgca tactagtagt cctaactact 19980 caggaggtta aagtaggagg atcacttgag cccagaagtt caaagagcta tgattgcacc 20040 actacactcc agcctgggca acagagtgag accctgtctc ttgagaaaaa gaaaaaagtt 20100 attggttttc ccattgcctc caaggcattc agagtggttg ttctttctca taatatcatt 20160 ttataaatat tcttttagac agtgatctca tacagttact gaaagagtat ttacataaac 20220 cctgtatgct gtagctcagc atttgatctt tgatatttct tctgtggttt tgatacatca 20280 gttgaaatta ttctacatag gtcagtatca ttcagaaaac accacacatc atagccaggt 20340 gctttaggtc cacgctcagt ctaagttctt cccatcatta ttctcgtgtg tggagagagg 20400 aagggaagga aagaaggaat gaaggagaga gggaaggaaa gagggtggga gggaaaggga 20460 aggaaggaga gagggggaag gaaggaagga aggaaaggag ggagggaggg tgaaaaggag 20520 agaaggaagg acagagggag aaggaaggaa ggtgagggag tgggaaggag agaggggaaa 20580 ggaaggaagg agggagggaa ggaggggaaa ggagggaaga atgagggaga gggcaggaag 20640 ggggaagaag gaaggaagga aggagggagt gggaaggaag gagggagaaa aggatgggaa 20700 agataaggaa ggaaggaggg tggggaagga gggaggggga aggaggaaag aaaggatgga 20760 ggggacagga aggagggatg agaaaggaag gagggagggg gaaggaaaga aggagaaaag 20820 gagggaggga ggaggaggga aggaaagagg gagggaaggg gaaggaagtt cagagtacta 20880 cttaaattag gcatcattgg atgattctgt tcagccaatt tgacacgtgc attttggata 20940 ccttagtgaa gtcgcctctt ttaaggattc catattaatc aagaacttct atcatgcttt 21000 tcggtatgga agggagggag ggggaaggaa ggaagggaaa aatacagaca ggaaggaagg 21060 ggacaggaag ggagggagag agggagggga gggacaggaa ggaaaatata ggcaggaagg 21120 aagggaagga cagggaggga gagagggata gggaagggag ggagaggaag gaagggagag 21180 agggagggac aggaaggaag ggaggaagga agggagagac aggaagaggg agggagggac 21240 aggaaggaag gaaggaaaat acagacagga agggaaagga agggaaggaa ggaaggaaat 21300 acaggaagga aaaggaagac aggaagaagg aaaaggaaga caggaagaag gaaaaggaag 21360 atggaaggaa ggaaggaaag aaaaaaaaaa gaatatagac agtatagacc actgcttggc 21420 cttgagtcag aagagttatg gacaagatgg aatgaggagt tttacctcct gtctggtttc 21480 tttgtacggt aacactgtta ctcaaatttt attcagtgat attgcattgt atgggccaag 21540 gttcatgtaa aatataatca gtgatactgc tctatcaaac attgcttctt ttcataaaga 21600 attctcagca aaatagactc taaatttgat ggccaaaact aatgtcctca atgcagtgtc 21660 caggctcagt ctaagttctt cccatcatta ttctcatgtg tatggagaga ggaaggaaag 21720 aaggaatgaa ggagagaggg aaggagggag ggagggaggg agggaagggg aaggaaggag 21780 agagggggaa ggaaggagga caggagggag ggagggaggg taaaaaggag ggaaggaagg 21840 atggagggag aaggaaggaa ggtgagggag ggggaaggag agaggggaaa ggaagttcag 21900 agtactactt aaattaggca tcattggatg attctgttca gccaatttga cacgtgcatt 21960 ttggatacct tagtgaagtc gcctctttta aggattccat attaatcaag aacttctatc 22020 atgagtttct tttgaaagcc tccaaatgag tttttagcct aataactagt gtgccagaaa 22080 atttctgtgg agcatctaca aacataggga agtgcatttt ttaatagata ctgaaaaatt 22140 gatttctgat tacaactaga actttccatg attacttaat gccccttgtt gaaccttaga 22200 cagcagtaat gagaaaaatg atagaagttg ttgacaacat aagagatgat gcatttaaca 22260 ctcgatctca gtttcttcct tggacttgtt tctaacaccc tcccaatgga ttgagatttg 22320 tatctgtgga gtaaaaattt tacaatatta ggccattatt agtgccattt ataaaggtgc 22380 atatgcttga gtattgtctg ccctccttat ccataaccac agattgaaaa tatttttaaa 22440 aacatacaaa tttcaaaata tagtataaca actatttaca tagcatttac atcgtgttag 22500 gtattgcaag taatctagag atggcctaaa gtatacaggg ggatgtgagt aggtatatac 22560 aaatactacg ctgttttata ttagggactt ggcacatgag cgtctttgga ttttggtatc 22620 catggggtcc tagaaccagt ctcccaaggg acaattgtaa ttggtaagaa gaaaatgtag 22680 agaccacaga tcggtgagca tttctgtctg tgaccttcct ctggaaaagc tcagtctgaa 22740 gcacaaactg ggtcttcata gagaggcact tccagctgcc tcagagcact cgctgcaaag 22800 cttcgggctg ctcagagaaa atggctcctg atgccattat gggcttctcg tcgtgccctg 22860 agatgtgctg ggttctgttt cttgtgtcta ttcaatgaat tctgaccctt caagagcttc 22920 ccagatgctt tccaggaaga accctgggcc ctggggacga ctgccttcct aaggaacttt 22980 ccttctagat gcctttgtag tttttcatgt ttaagtgaca agtagtgagt ggcgctgctt 23040 gtagaatctg aacacaaaat gtccccggat tagtaggtgt catccagtta tcagtctcgt 23100 gctcacaccc ccgccaccgg gtttcgattt gcccatctct gctttgctgc tggatctgct 23160 ggtccttggg gcctcactct gaaccacatt cttgtgtgct agagggaaat ttctgtgtgt 23220 gaaatgtggc cactagggag tcagtaagtt caggtacacc aggaaaaggc gttgagtccc 23280 agagaaagga accttcaagg agaagcttgg aagagggagg aaatgagtgt gacaagggac 23340 catgggaaga aggagcagaa tgtttaaact gaaatccctg tatatactgt acagacaaat 23400 agctcttttt ttctcaaaaa ccagttttct gggggaaaaa aatggattgc catgaaatga 23460 agctgttttt aaagtagaaa tgtccttgta ggtgagagaa atctacgcag catccctgga 23520 aaatgaagga agtggttatc tcggtgctga ctcccactca agggttgcag aggaaacctg 23580 ctagttcctt gtgatgttgc cattatcact ccttaccatt cccctgtatt gtttacaaaa 23640 caggaacttc ttgaatccat ctctgtaaat aaactcagct ggacttttta tcaatctctt 23700 tatctctttc tagatgacag tctaaggaca cttatctaac aggtaccatt ccctgccggt 23760 tgtggtagtg accaccctgt gctagcctca gtggccacaa gatgagagcc ctgagggtcc 23820 tcctaaggaa atgaaagact agggaaacaa aagtttcacc aagttccaga agtcctaaag 23880 catactgtga ttatacaaag ctatcccttg atctctcacg tctaataaga tcgtgtgcgt 23940 tgtcacatca gcgactccat catgtgctgt tcttttcact agaagacaca attccaaaga 24000 aggtggtgtg ccacttcagt gggaaggcct atgccgacga ggagcggtgg gaccttgaca 24060 gctgcaccca ctgctactgc ctgcagggcc agaccctctg ctcgaccgtc agctgccccc 24120 ctctgccctg tgttgagccc atcaacgtgg aaggaagttg ctgcccaatg tgtccaggta 24180 tctaagccac catccttcca tttgtcaagc tgtaatgttg atttaagatg aaatcgaagc 24240 agagctcgac ctgccccttc tttcattcgt tcacacggcc caagtagcgt gtcaggccca 24300 gttagcgcct ccacccaaag accctatggg taacttggaa aactatgacc atcccgatga 24360 ggggcagatt ttggttccat gccagaattc ctcgggttct tttcttctca tttgcattct 24420 ctaaacttta gtataaccaa cctgttaatg acacctgccc tattagacta gcacagtttt 24480 agtgcagtgt tggactagcc tagagaagat ggtccaagat gggacccaga agggaagatg 24540 cttaaaagcc attacactgt gaaaatatat tagaaccaaa gtggaagacc tgcatttaaa 24600 gaatatagtt ctggttcatt ctttttttac ttcatcagtt ccagccagta cccataccat 24660 ccacgcagcc ttctcccaag ccctcagctg aactgcccat ctctttccac acttaacgca 24720 accactgagt gtaccccagc tggctgttgt gaatccccat gcctacatct ttccccacgc 24780 cccactcttt gtgtctgctg cattatccct ctcatagttt caaacctaag tccttctcag 24840 ctgcccagag cctcacctag cccttctcct agtttccatg cctgcatctc ctgccaaaca 24900 attcctcttt cccctgcgtc taatcccaga aatgtaagac ctgagccctc tccctgggga 24960 ctgaccccta gacacacaga tttcccggtg gtatagacca cacagaccca agtgaaggct 25020 cactctaaga tctcattgag gtcctcagcg agtcaggagc ttttctgtct tacctcgatt 25080 ctcatgtttt ctcgcagcat tatattagag aaccttgttg tattcaatac agggagctgg 25140 gaaccatgcc aattcacaag tgcatctgta ctatgcagtt gtaagatagg gcagttgtta 25200 aacccaaaga gtttaagtga gctctgcaca tgttggcctt ttgtttaggc gttgtgcaga 25260 ctagccccaa ttgagcattt ctaaaaatgt ccatttatct cctaaaaagg ataggatgtt 25320 ctgcaaggta agggggtgga accatttttc agagccttca gttgggttct ctttatagtt 25380 ttccatatac acacttgctg tgtatatgga aaaagcaagt cttaagtgct ttaaatgaga 25440 ataagaaaaa atatatggtt ttccactgga attattttat ctagtgtata ttcacctatg 25500 tttcacaaat ttcataccag tttccccagt aatatgcttt taatgatata aatgatgaaa 25560 cttgacctta tttatttcac gaagagtcat ttaatgcctt ttatgtgcca agcactgttc 25620 taggcactgg ggacacagca gtgaaccaaa tagcagtgca cacggagctt acgttgtagt 25680 agggagaagc caggcagcta acaggtaaac agtaactagg tagccaccaa gatggggatc 25740 cggctctgga gaaaggcata gcaagacaga gaggctgggg agaagtgggg caggcaggtt 25800 gttttgtatt atgtagaatg gtcagggagc accttaccaa caagatgaga tttaatgaag 25860 agacccagaa ttgcagttct catggggaac aggcagaacc tctgggtata gccagcgcca 25920 aggccctgga gtgggaacgt gctgtgcgtg ttcaaggtag agcagtgttc aaggtgcatg 25980 ttcaggagga gctgtgtgga ggtaactcag tgagcatgga ggaggtaaga ggtggggcag 26040 agaggcagcc acggccagac acaaagggtc tcgcagatga gtgcaagatt gtattgagtg 26100 agatgaaacc ttggaaggtt tggagcaggg agcgacagga tctgacttac ctttgaatgg 26160 gatttctctg gtgctgggtt gaacatagac tgtggggcat ggaggtcaat agtgaaagca 26220 gagagaccaa cttggagact tttttaagaa tccgggcgag agatgacagt gacttggaca 26280 gggtggtagc cgtggaggtg gggagaagtg ggcaggttct gactatgcca tatgagtcgc 26340 atcagtagga tttcaatgta tgcggggagt gagtggaaga ggagccaagg atgactacga 26400 cattttggtt tggggcattt ttagcttgaa atagtcattc agcatcaaag tggagctaat 26460 gaatagacaa atatctaatc tgggcttcag gaaagaaact gaaagtagat ctacaaattt 26520 gaaagtttag ccataggtgg tttctagtct caaggttttt aagccttgag actagatgag 26580 agcaccgaag ggtaagaata aatagaaaag aaaagagagc gcaggaccga gccctgagac 26640 cttctgactt ttggagactg agcaggtgca gagggcctag caaaagaaat agagaaggag 26700 caccaggaat aaaggaagaa gccaagaggg tcagtgacat ggaaaccagg tggagaaagg 26760 gtttctggta ggaaagagta aactgctcaa tcaagcgcca ctgttagcca cgtaagatgc 26820 gatgtgggaa cagacttgtg cacttagcac tgctgtaatg aacttcacag aagtacagtg 26880 gtggaggcaa agtctaatca gagtgatttc aagaaagaat gggagaggtg aattcaattc 26940 acctctccca ttgcgagtta tggttatgca actctcatgc agaccataag tgccaagttc 27000 atgaataaga cagactgtga agttgagggt ccttggaggg tttcacagag aaaggacatg 27060 agctagattt taaaggacag atgtgctgtc ttcattggag tgaagaggat aacccaacca 27120 agacctgaga tgggaaaagt caaggccggt gcctctaagg tcaaggccag tgcctctgga 27180 gcagtgagtt ttagtagcag taggtaagga cagaaacagt agctttggca aggatttttc 27240 aggccttaat tatcaaatga agagtgtaga ttttgttctt tatagggaat ggggaggctt 27300 atttttgagc agctgaatga catgttcaca aagatgtttt ggggtaacct ttgtgtaaca 27360 gggtggtgtg caggaaggac taaagcagag agagaccaga ggccaggaca gctgtaacag 27420 gcactgcaaa tgtcctcggt gtgataacaa ggacccaaat tcaggtagtg gtggtagaaa 27480 tagaggtcaa ttcattctcc ttggtttgtg agtttgataa gagagagtac agatattgct 27540 aacatccttg ttcatcatta gagatttctc cagtgacttc tcagttaaga tttactattg 27600 ggttgaaatt taatttctgt taactctcct ttctgcatct gctgtattcc tgatgaaaca 27660 tttttctatc tttaagcaaa tactgcaatc agttgagctt ttgccacctt ctgtttgatc 27720 cagtatgggc tgccaggaag gctgtattcc tgtcccattc ttgcatcggt tcattgtgtg 27780 atcttagtca agactcacca ccaaagtttc acgcttccat ttctcatctg gatagcacat 27840 gccacctccc agaaatgctg tgaggattat accatctcag aaatgatggt gtgggagcac 27900 ttcccggtct gtccagcaaa caagaacctt gcttcttcct ttagacatag caactaacat 27960 tctgaatatc ctctttcttt acaaaacagt ttcaccttta ccatctttgg atatgagtac 28020 agaacctatg agctgttagg tgattagcac ctgtctcttt acagaagaaa ctgaggctca 28080 ggaaaggtaa gagggttatc cgaagtcatg gagctggtta gcagaggatg agacccctgc 28140 tcctaaattg acctcaggtt ccatctccca aaaaaatgct gacctcagct tgtaagcacc 28200 aagcacctcg ccccgtattc cagaagagga cattttaagg aaggcaactc ttagagatga 28260 ccagatgcct ctgaaagtaa aatatatccc cagtaaatta tcttgagttc ctccaaagcg 28320 caggtcaagt atgagcacag gagcgattgt gaccgtgtgc ccgtgatctt cgtggtaggg 28380 ctggatgtag ctttgtgtaa atctccctgg cactgtccat ctgtaatgcc aacagctgta 28440 aagggctctt ctagtaaata ccagaagggt tatattttag agtcttgggt tgaaggcggt 28500 ctttggccat agaaggcggc ctgggtctag gcgtggaaac aaatagcatt cccacttctt 28560 tttgtttcaa caagcagttt tttaactgaa agaattgttc tttttggcaa aactgctcag 28620 aggcagcttc ccttttatgt tcaaattaaa gggaaatgac tctcttccct tggacttgag 28680 gactgtgaac tggagctgtg cgtgttctgt gataaagaag gaggtgtggg tggattttaa 28740 tcagagctga actgagtggt ggagtgagct aatacacagg acaggttatg agctgctgga 28800 agtaagaagt ggctgtcagg catcactcat caggagagac ccatgggggc caggattggt 28860 ctcactcaaa actgtatgcc caaaaccagt gcctgccatg taggaagtgc tcaggaaata 28920 tttgttgaac aaatggtaat acaattaatc actgaagaag acatcctcct tcaccataga 28980 cagatccttc taactttccc tgctgtttcc aagtagagcc cctgtgggaa gaggactcac 29040 tgtcatgcct cagcttggtg gagtttcacc ggaaatctac ccatatgcag ggtcaaggca 29100 aaagaattcc aaagttacgt ctctccctct cactcaggaa aaaacctgag gtggaactga 29160 atcaatccca gctctggggc ctctgcagaa acttttacta cttagccatt gacatttaca 29220 ggtttgttgg aattggtgtc gcatttgccc tttacactga ggcctcagaa accagtggca 29280 gttcggcccc tattgtcata ggaaggctca gtgtgaagcc ctggactagg caggggaggg 29340 gcagaagggt gtgtgtggca ctgaaaggtg tcaagtaaat ggcgacatgg cttttctgcc 29400 catattcacc caggaaattc ataccacaca tgagagtttt tcagtgggtg cattaaatac 29460 catccccctg cacacacctc ccggaatcac ctagtccaga cctaccgaat caggacctgt 29520 gagagtcagc acattttcta agctctttgg gtaattctga tatgattttt aaaaatttga 29580 gagtcactgg tttgtgctca ggcccctgca cctgcaaatg gtaatgtttt cagtaacttc 29640 tatttttaat tctcgagttt gcagggatgc actgaacctg tagaaactat aattgttccc 29700 agttcttttt atgtttaaag gttttataat ataactaaga aagcagaaga ctattcttgg 29760 aggcaacaag ataacttggg ctttggaatc acagaactgc atttgaatcc cagctctacc 29820 acttactaga tgcatgattt tgagcgattt aacttaacct ctctgggttt ctttccctgg 29880 ctgggtaccc agttaactca tgggggaagc ttagcaagta ctctcttctc tatcactttc 29940 ctcttctctc aaagtcatct taaggtttac ttctcaaaag cctgttaact ttgagacaga 30000 aaacaatcac tctgagcaga ggggattatt aatcctattt taatttatat acttatactg 30060 tatattttat gttggtctaa attaggctat ggtgagtatt taggatccta cgtgttgagt 30120 aattgcttta taaaaaaatc agtgaataat atggcaggat ctgcccgaaa ggaaatacgt 30180 gcatgacaca aacttttatt gtcacatttt gaaaacaggt tctgaagaat taattgcttt 30240 aaaatgtggg gctagttgtc gttagctgga tgaagtataa atacaaagaa atatttgtag 30300 gagctgagaa aagttctgat tttttttttt tttaaaaaga cctgatattt gagataaata 30360 atattcttct gtttgtttta tgttgtagta taatacctat ctgatcaaac tggataatgt 30420 aaatatattt actgaagatc agcttctaat ctaaatggtt ccagtggtaa cataatggac 30480 atctgagact tatactcatg aggtctacct acagtccatt cccatttagg ggaagagaaa 30540 ggagaggaaa tggccaagaa gtccataaga agtggacagt gctttaactt tttcaagttt 30600 tgctaacata ttttatgtaa gttattcaat acccccatat taaagagcag cagctattaa 30660 acaccaacat ttattagctg ctggaaagta ataaataact tcagctagtc cccttcagtt 30720 tccataaaat tataaaacca acaccatttg acgtgaaaaa gcagtttttg accctggtgg 30780 agtagagaat tctttcatct tttaaatgtt ataagacatg tttctcctca ctgcaaagct 30840 cgtgattctc tagcatgagt gatgaatgat gtaaatggca gaaactgcat ctttcatgtc 30900 ccaggaaatg tggtcttgaa ttgagtgcac catttggatt gagaatacaa aagccaacaa 30960 tacagcagct tcatcatctc ttaggaaaaa tgttcccttc ttttagaaat gtatgtccca 31020 gaaccaacca atatacccat tgagaagaca aaccatcgag gagaggttga cctggaggtt 31080 cccctgtggc ccacgcctag tgaaaatgat atcgtccatc tccctagagg taagcattga 31140 aggcagctga gatctgctag ttttctatgt ggtctactta ggtgttttct aatttttaaa 31200 atttcctatg agtaactttt aaaatgactt gctgaagtaa attcaaataa ctattcccgg 31260 gcagacctgc ttggctgaaa catatttaca ccacacattg ataaaataag tcagcaggta 31320 actggattaa atgcctcctt cctggccctc tgtgctcaga gactttgtga tgacggaaat 31380 aaatggaaat aaatatgttc ctagtcataa gagcctggta tctggaactt ctaagccatc 31440 tctcaaacgt atgtatatat aaactttgta aggagagtaa tcatgaatta gtcactgtct 31500 cttgtagatt ccgcactgag catttgtaca ttaggttacc ctctacttct gggcagtttt 31560 cacttcatgt gagctaaact tatcacctat gttgattttt ctagtcacca ggtttactcc 31620 atgccttttt atcttgctct atcctttcag tctgtgcgta ataaagatgt tattttgtct 31680 cccaatagct gtcactattt ccttttctat aaaagtagac ctcttcttga tcagtagtat 31740 aaacttgctc ttagccgtga ctcagtactg cctcctgtgt cctccaacca tattttccaa 31800 atcattgcta tataacttat atttcttttg gaagaagcaa tattcattgc ttgagagaga 31860 ttaattgtct gttactctag atattctaaa cgttatgctt ggatcaaacc agatgaccag 31920 aaagtcaaat tcctggacct agtgtttttg gctctctgta taattgttct gagtgtgctt 31980 tagcattatg cttacattca ttcagtcagt aaatattagt gcctgctgta tgccagaccc 32040 tgttctagag gctagacttc aacatactgt aagtacgact attcttatga ctgaagacct 32100 gtgcattgca tagtatctat gttataaagc ccctaaagag aaattttctc atgcatcaat 32160 agcaagaaaa tgtatgcttc tcctaggcaa ctgatttgtt ttaacctaaa agacattgtg 32220 aatcgtatta aagactgata gttgtttccg tttttggctt tggctgctac aaagctcaaa 32280 gctaactttg atggcttggt tttagcacag aaacagaact ggaagactga attttgtaaa 32340 cagacctcac ttgcagcttc atttttcttt caccaagacc ttccttaatt tatatcttat 32400 tcttcttgta tgacacatat tcctaaaact gctttctatt tgatttggaa tatgggaaag 32460 taaaattcat tcttttattc attgtatatt taaaatctga attcatcaat aaaatgttac 32520 tctttgacat ttttcttgat gctggaaaat atgtttctat aaagttttat tgccatctgc 32580 actggctaaa atgattagga atgttttcta cctcccattc ctccctgctc ccaccacccc 32640 tccagaacac acacagaagc atatcattta aaatctgagt tttaaaccac agtgttacat 32700 accacaaagc tctgtcactt catgagctga atttccagaa caatttagga gtaactcctt 32760 ttgatactgt gtgtcatgtg tctttaacta aattgatttt taatgcattt agaaaacctg 32820 actacaatga tttatttgaa acaaaacaga atataaaaat aaggaaactt tagggagtgc 32880 cccctattgg tatgtgttta accttatttt tgtctgcaac attcattgta agagagatat 32940 tttaaaagaa cataaaggtc atcattcata agtgcaaaac taaatgtagt cacctatgcg 33000 atataattca ttaaggcaga aaaatgatct aataggccta agcaagattt tcaacaagtt 33060 tatgaaaagt aagacaatat tgcttagagt tcagtgaacc agagtatcaa ccaaaagacc 33120 aacttcctga ttgattagtt ttaatctaat tagtcttagc tattaacact aattaattaa 33180 tagtaaaaga acatactttg ttctagtctc ctgtttgcca ataaatcagc taatgtttct 33240 ccttcgaaat ctgagaagac catgtatgtt aacagtaatc aaaatatttt catggtctaa 33300 ttcacttttt caaaatctac ctaagcacct tcttatttcc ttgattagct agagccggca 33360 tcatcttacc caattattaa gattctcact ctgatttttt tttttttttt tttttttttt 33420 taacattcat ccctaagcgc tctgctctct actaagagct ctttggatca acattagatt 33480 ttaatcacat ttttagcctc aaaaaactgg gaagcaataa aaccaagttg ctgatctcta 33540 tttgcaatgt cattaaaata atccttaaca attctggttt aggtaataaa tgcttataat 33600 tttttttctc ctagatatgg gtcacctcca ggtagattac agagataaca ggctgcaccc 33660 aagtgaagat tcttcactgg actccattgc ctcagttgtg gttcccataa ttatatgcct 33720 ctctattata atagcattcc tattcatcaa tcagaagaaa cagtggatac cactgctttg 33780 ctggtatcga acaccaacta aggtactgtc ttgcaaaagt tagtctcttg aatgatgaat 33840 ctaggaaaac ttacactcat attgaaattg cttgaaacct tgtgtcttaa ccagtgaagt 33900 ttttactttg ctcttcatag cctcaaatcc ataccaggct atagtatgaa tcaaatacat 33960 agtacttgcc cagggctgga ctgatcacaa ggtgaattat ccattatcca ttggttaatg 34020 aagcctaacc ttacatggat ccagttttct cgatgtactc tgaagttttc aaatgaacaa 34080 atactcctgg atttattcct aaatcattgt ttatattaga cattaaaacg tgatcacctg 34140 gcatttctta gtcatcagtc aacttagaat tgggagagaa aaaaagtccc aaagggatgg 34200 agataatact caggagaggg ctggaagatc atgagggtat acaaaaggca aaaggaacag 34260 gtcagtgcac caaaggcaag caccaagtgg aaaggtcgca aaagatgaag aggtggtgga 34320 aagtaatagt agacaaggta gggagaggca gatttcacac taagggagaa aagatgtatg 34380 tttgttttca gtacaaacat aaagggcctc taaatctgcc tattttgaaa gacaagaaaa 34440 aactgtatta attaagggaa ataaatcata atcccttcca actctaattc taccttttgc 34500 cttgagttgg tttattttaa tcataaacct taagaattgg aaacagcctg aacaattatg 34560 taatgtaatt taccatctgg ttaacccttc catgtaaaaa ttctgtcctg cggttatcct 34620 gcctccgatt tttctttgaa caaatgtaca ttgggcacag cctctgtgct gggtcaggat 34680 acagttgcca gtgtccaaag ctgggaaaca gacccgctca taaagtgtat ggagtgacta 34740 gcaacacaca ggctggccca gcctatgata gtgttgtaca gtctgagtgg ttagaagtca 34800 tccctttgtt tttaaaatct gcctaccttc cactcattct tcagccccct tgaaacacaa 34860 ggtctgcact tcaaggattt gaaatcattt caccagtctt tttttttttt ttttcagttc 34920 cttcatctcc ctatttgaca gtttcttgac atttcacctt aaaatacagt ttgtcagtgt 34980 ccctcataaa agtatagttg gtactagaac aagatgtgat aatctgcata caggctatca 35040 atcaatgaat tgtgaaactg ttatctcacc tgagttaggt actaaatttc tgttcatttg 35100 agataatgta aagcaactaa ttttttgtgg ttttattcct cctctcatta aatagccttc 35160 ttccttaaat aatcagctag tatctgtgga ctgcaagaaa ggaaccagag tccaggtgga 35220 cagttcccag agaatgctaa gaattgcaga accagatgca agattcagtg gcttctacag 35280 catgcaaaaa cagaaccatc tacaggcaga caatttctac caaacagtgt gaagaaaggc 35340 aactaggatg aggtttcaaa agacggaaga cgactaaatc tgctctaaaa agtaaactag 35400 aatttgtgca cttgcttagt ggattgtatt ggattgtgac ttgatgtaca gcgctaagac 35460 cttactggga tgggctctgt ctacagcaat gtgcagaaca agcattccca cttttcctca 35520 agataactga ccaagtgttt tcttagaacc aaagttttta aagttgctaa gatatatttg 35580 cctgtaagat agctgtagag atatttgggg tggggacagt gagtttggat ggggaaatgg 35640 gtgggagggt ggtgttggga agaaaaattg gtcagcttgg ctcggggaga aacctggtaa 35700 cataaaagca gttcagtggc ccagaggtta tttttttcct attgctctga agactgcact 35760 ggttgctgca aagctcaggc ctgaatgagc aggaaacaaa aaaggccttg cgacccagct 35820 gccataacca ccttagaact accagacgag cacatcagaa ccctttgaca gccatcccag 35880 gtctaaagcc acaagtttct tttctataca gtcacaactg cagtaggcag tgaggaagcc 35940 agagaaatgc gatagcggca tttctctaaa gcgggttatt aaggatatat acagttacac 36000 tttttgctgc ttttattttc ttccaagcca atcaatcagc cagttcctag cagagtcagc 36060 acatgaacaa gatctaagtc atttcttgat gtgagcactg gagctttttt ttttttacaa 36120 cgtgacagga agaggaggga gagggtgacg aacaccaggc atttccaggg gctatatttc 36180 actgtttgtt gttgctttgt tctgttatat tgttggttgt tcatagtttt tgttgaagct 36240 ctagcttaag aagaaacttt ttttaaaaag actgtttggg gattcttttt ccttattata 36300 tactgattct acaaaataga aactacttca ttttaattgt atattattca agcacctttg 36360 ttgaagctca aaaaaaatga tgcctcttta aactttagca attataggag tatttatgta 36420 actatcttat gcttcaaaaa acaaaagtat ttgtgtgcat gtgtatataa tatatatata 36480 tacatatata tttatacaca tacaatttat gttttcctgt tgaatgtatt tttatgagat 36540 tttaaccaga acaaaggcag ataaacaggc attccatagc agtgcttttg atcacttaca 36600 aattttttga ataacacaaa atctcattct acctgcagtt taattggaaa gatgtgtgtg 36660 tgagagtatg tatgtgtgtg tgtgtgtgtg tgtgtgtgcg cgcgcacgca cgccttgagc 36720 agtcagcatt gcacctgcta tggagaaggg tattccttta ttaaaatctt cctcatttgg 36780 atttgctttc agttggtttt caatttgctc actggccaga gacattgatg gcagttctta 36840 tctgcatcac taatcagctc ctggattttt tttttttttt tttcaaacaa tggtttgaaa 36900 caactactgg aatattgtcc acaataagct ggaagtttgt tgtagtatgc ctcaaatata 36960 actgactgta tactatagtg gtaacttttc aaacagccct tagcactttt atactaatta 37020 acccatttgt gcattgagtt ttcttttaaa aatgcttgtt gtgaaagaca cagataccca 37080 gtatgcttaa cgtgaaaaga aaatgtgttc tgttttgtaa aggaactttc aagtattgtt 37140 gtaaatactt ggacagaggt tgctgaactt taaaaaaaat taatttatta ttataatgac 37200 ctaatttatt aatctgaaga ttaaccattt ttttgtctta gaatatcaaa aagaaaaaga 37260 aaaaggtgtt ctagctgttt gcatcaaagg aaaaaaagat ttattatcaa ggggcaatat 37320 ttttatcttt tccaaaataa atttgttaat gatacattac aaaaatagat tgacatcagc 37380 ctgattagta taaattttgt tggtaattaa tccattcctg gcataaaaag tctttatcaa 37440 aaaaaattgt agatgcttgc tttttgtttt ttcaatcatg gccatattat gaaaatacta 37500 acaggatata ggacaaggtg taaatttttt tattattatt ttaaagatat gatttatcct 37560 gagtgctgta tctattactc ttttactttg gttcctgttg tgctcttgta aaagaaaaat 37620 ataatttcct gaagaataaa atagatatat ggcacttgga gtgcatcata gttctacagt 37680 ttgtttttgt tttcttcaaa aaagctgtaa gagaattatc tgcaacttga ttcttggcag 37740 gaaataaaca ttttgagttg aaatcacaca gtgtaacttg caaaaagatt ccatctctga 37800 gatccgaaag acttttaagg ttatcagtcc aatgaacccc tggtgtaaca agaagtcccg 37860 tgcttttgtc attaggcggg tctttcccaa ggtatccttg caaaaagaaa agtgtctccc 37920 agagtacaga gaatagcgat gctgttttta gggagacaag aaacctacct gctgtcagaa 37980 acgtgctcat agctggtatc aaaattaact tgctacgcac aggaggactg tggtgggaaa 38040 tcctacagtg gttcgatgct gaagaagaaa gactttaccg agcacgatca ggggaagcct 38100 aactgcaaag agactttttg agctgacagg aaaaaataaa taggtatatt ttcatacatt 38160 ttagaagcat caatttaaga atgagctata aacagaccaa tgttgaatct cctttaaaaa 38220 aatatatccc cctaaggact ttctactata tcatataccc tttaaagcta catttcttca 38280 gtacaaggcg tcatttttac aaattcaaat taatttaaat tattaatatt actgtgtaag 38340 tacatgtatc tggggaaagg gccagtaagt tccaaataag aacattaata aaacaaggaa 38400 accatctcga gtttctcata cccctagaga tgatggctga aatgccaagc aggcattaaa 38460 cttcaaaccg cgcccctgcc tttgcagtca tggtgtctca gttgctgagt ggagtgagga 38520 atatgttagc cctgtttgtt ttgctcacag gtgtttaagt ggaattgtga aagccgcctg 38580 tagcaggtcc tatatagact cttaattcat tcatgtcgtc tatgggaggg gtctgcatgg 38640 ctgtttttga aatgtctcac agccccagaa aaagagcagg acttacttct gtcattccac 38700 agcagctcag catactctaa taaacgtcct cctcttcccc ttcgtagcat tctgggacta 38760 atcagggttt catctgctca catattgtag cgatgggttc tgtgtggaat taagggaaag 38820 aatctagcac tgtaaggaaa aaaaaaatac aacacaaaac tcccaaatat aaggctctca 38880 aacatgtatt tttacttctt aaaaaattta ttaaatgcca ttgatttgac tggaaccagc 38940 aaaagtgctc atgatattga tgaatccata agtagctgta tctagaattc atacttaaga 39000 aaaacacaca ggcatgaata aaataggaca cagtaatgag aatgggcagt tctgcagtgt 39060 acactttctc aagcacctca gaggacacac acttaaagta caattcttca cagacacatg 39120 aagcagaaca tttgaaaatc aaaacttaaa tacaagattc taaaagtgaa tggcaattta 39180 atggttaaaa tttgaccagt ggtgaaacaa gcagcaagct acaaaatcca tcaccaccaa 39240 cagtttcaat gttagcacta agtattaaac caaagtaatg catattctgg ttttgcttct 39300 tcaagaacag cccgtttatt ctgtttcaat ctctctgaac aatactggta ggtaatagtt 39360 acactgacaa ttctcacaaa aaaacagtgt tggttcttgc catcactgaa tttatagtat 39420 aaatctccca aggctttaat ctagttcatc agtgcagttt agataagaaa gccataaaaa 39480 caaatgggca tactgtcagt taatgagaaa tacaactgca catgcacaat taatattact 39540 ctctcttaat ctactaagag aacagtttat tagtagattt aacaaaaacc atctctagaa 39600 ttcaaaatag tgcccatatt tccgttggtt ctataatata aagaattagt gtagtcaaga 39660 tctgttaata ctgaatcaaa cccaagttca aatgtgctga gtttccaata gcaagaaggg 39720 taatggtagc cagctctcag aataatgctg tcggaaatct agcttggagc ccaccgcaga 39780 taaagttgct gctctatgta ggacacagaa ctagaagaaa aagagaactt ttagctacaa 39840 atgtatattt tgtatacatt tacacaaaac atacatttta catgtacatt taaaaaccat 39900 taagtaatga gaactaaatg ttaatgcaaa gatctatgta gcaatttttt aaaggtttta 39960 agaggctata cccaacaagg atgagttctc tcaacacaca ttagaatttc ccagaaatga 40020 gataccaaaa ggccatgtct cttagggtgc cacttggaga tagaagcccc taacattcgc 40080 cctataccat gaggcttgtg acacaacaaa caggatccct agggaagtca aagaagggtg 40140 caagtttctg cacacatttt tattcctaac tagcagcacg gcagcagcca cgcgtacatt 40200 acttcagaat cagccttggg tactactgtc aaggcagcag tggcggcagg ctgcagtggg 40260 gtcattaggc aggtgcgaga aagagaagct caattagccc atacagtgcc caccctactc 40320 ccttttctca ggccaaatga gaccactttt ctaccttctt tcacttctga catttgcatt 40380 aaaactaagt ggtgtattta ggcagccaag ccacactctc atctgtgtta aaacaacaac 40440 aaaccaccca aaaaagggga aggagtattc tccccttgag caatatcccc ctcccgggca 40500 aagtgtcgta ttattggtcc ccaaggaaga ggaacagtta accagctgtg aactttcagg 40560 atgggtggaa aagagcattt tcaatgaccc gacttcccca ccttcccaca cacatttggg 40620 taaatacatc tccacattgc actcccgtct tcattttctc tgactacagt ctccactgta 40680 gtctaaatcc ttgctgcttt tccacccagt cctcacaata caaagccaag acaaaaccac 40740 aacacatgca cacagacagc tcatcagaag caggtacagt atagatagaa ctgggcttac 40800 agacaggagc cagcggcccg agcatgagga gcagaattag atgggtgctg cttccctaag 40860 gcatgcggta taggtaggtt cacaattata gtcctaggtt ttcagagtaa agaattaaaa 40920 atcatttttc cccagaggac tctgttcttt gggtcataac atccttccat gcagttaaat 40980 gctaaaatag ctagagaatg cttcagaagc tcattctatc tctggccctc atagaggcct 41040 aggaacccga tataattaaa atttaagaca tgttctttat gttcgtctca ttcctgtttc 41100 acatctagaa gaacgaacct cagaacaatc cagctttcat accaatgcta gcacaacaca 41160 aaagcaaaca aagatgtcag gtattaaaaa aaaaattttt cagaatttta tagggagaaa 41220 aagtcctcca tcagaaaaaa agcttatttt ttaaaatgat gtcaactcag tctgttctac 41280 tccagtgtca acccactagt gtctagctca tgcttcttac tctcaccgca tcaagactct 41340 tacaagtgaa atgtgtgatt cactttgagc accctgtgta tactactgag gttccccttt 41400 catgacacta tggtttaaga aaagaccatc agaggtttct aatgtctcag aaatcttcat 41460 cttccttgtg ctcagctaac tttaatggag aagagagaga acccagagaa gaagactgtt 41520 actagaagag gattgctctc aaaatttgat caaacactac aaagatgcat tttgtgatgc 41580 tttcaaaatc tggtccctga atacctgcta ctttaaaaaa atatttcatc gtaaagtatt 41640 aattttgcct ctaattaagt tcttccgtcc cttctttatc atccaaactt tgctcttttc 41700 tcggcaaagc aactcaacct ctcactcttc ttttgcctca tcaatattcc caccctgtgt 41760 gagcctcacc ttgcctctaa gatttgtggc tgctttttca ctttcctggg tttatgcaag 41820 tccattcttt ttatttcccc gcttccctcc ctgtgtctta catcagtccc tttcgtattc 41880 cctttggcac tgttcacaga tgtgactctt ccttccttca tccgtcctca ctaatgatct 41940 gtcctccttt ccactgtcct caagatgaca gagaccaccc ttctcaaatt tcccagtgcc 42000 aatctgccaa catctgatac ccacatgatt atggtgccat ctcactaatt ttcttctact 42060 acctctcttg atgtttatag tttaactgtg gagattattt ccaacccaga acctccaata 42120 aaacttaaat tatataaaac catgatatat aatgttggta gaagtcagtc atgtagctaa 42180 tcctcaccct gcataataag attaatgaaa gttgtgcatt gggaagttac tgtctaatgt 42240 attagcaagg cgatgtattt actgactaat ccaaacctcc cagccatcgc acctatacca 42300 ttataaaact cacctttcag aatataatca gttaacaagc tcggaacttt ttcactgtcc 42360 tccttcatgg cacgaagaac tgcaaaatag aagaggggaa aaaagacaac aattaaaaat 42420 ttagatcaaa aatattattt aatttcaatc attgtggcct taatcttcct taattatagg 42480 atataactga ctcatacaaa ttataggtga ttttcctgat aaggggatac aacaataaag 42540 gcccaataat ttattttagt ggagtaagtc cacaagaatt tattttcggt aagaatatcc 42600 cacttacctg agagttgctt ccctgattta aaatttccaa acagtggaga caaaaaaaga 42660 ggaactgggt tatttacaag gagccttcgc tctgtaataa taccggaaag tgggttacta 42720 taatgtgcac ctcaatgaac tctgtgcagc aagaaaacct cagtgctgca gatcaagcaa 42780 actagagaaa ctcttaaaga aatggacagc tataccctcc aacaagcgta tcctgtcaga 42840 ctccttagaa actttcaaag aaaagaccca aatgaagaac agacttccat gaagcagcat 42900 cgggcatctg cctggctgtt gcctcctctg cactagggag aaacctggga gaattagcct 42960 cattctaatt tgacaaattt atcttctaag taaaaactaa aatagaaaaa gcagccacct 43020 agtataacgt ttttcttttt taagctagac cattcaggtt gagataaact cacaaatatt 43080 atactttaac agagattgcc aactcaaatt ttctccaaag catcctggta aaaatatttc 43140 actgtttttc cccaccaata cgacaaggca aaatagtgta actgtcaaat cagccttcct 43200 gggtttaaac cctggctctg ccacttccta attgtgtgac gctgagcaag ctacttaacc 43260 acttttgcac caagctcttc atctgtgaaa tgagaataat aattagattt aatctcaaag 43320 gtttgttatg aggactaagc tactaattca gatataaagg tcatatcata gtgcctggca 43380 catagtacat gctatgtatt aactgtaact aacactactc aaattattta aatttttatt 43440 aactattatt attgcaccca aaatccccta atagctttta tcaggaagac tatatatcca 43500 taaacactgt gactgtctcc acagggctct tcttttggaa agaaaaggag ctttggaccc 43560 aaatatgtcc agagattcca aactgtgaca acctataaag acatctcgat gctggaatta 43620 tgatgacttt agtttgacca atgaagtatt tgttcttgat ctgagaagct gaggcctgcc 43680 agtaattcct ggcttatgag caatatttat gttcccttct ttagagggct accatgttca 43740 gtatttctgc ttccaatcat gtttaaaatt ttttctaact tctaaataaa atcacatacc 43800 aaaaaccagc atcattaaaa gtataattgc ctgggaatta agagatccag ctctgtaact 43860 aaccatcccc ctgtgtgatc ttggacacat ataacttaag ttcttcctga aaataagagg 43920 gctagctacc cagttaatct cttgattttt tatttcagac tggctggcat cacttgattt 43980 atccatattt aaatataaaa tttttaaaga attattcacc aaaggaaaga ataaaagctc 44040 caagttagca ggcagggagc taagaaataa gacacttgct tctactctta ggttttcctt 44100 gcccattcct cttaaatttc caaaataagt atcatcatcc atactcatcc atgtgatgct 44160 ctttcatttg gttcccccct aattagcaag tatcataaat atttaccata acaaagacct 44220 aacagatcag caaattcaat ccctccattt taacggctaa ttgatagtaa cttagccaaa 44280 atcatacata tattaatcta tttactgtga ggtaaataga acaaatattt tttatcctag 44340 tttttcaatg taagcaaaag agagtaagta acttccaagg atatccctga aaataacagg 44400 gatagaacca gaattcaagg ctcatttgcg gccatcataa gagatactca atatcatgta 44460 tatttaaatc taacaatttt ttttttaagg caagcccctt gaaggtgcat tatctttgtc 44520 tgctttgttc actaaagtat cccaagtgcc cagatcatgg cctggtacag agtaggcctt 44580 cactaaatat ttgctgaatg aatgcatgtt cttaaaacat cactgtcaat ttatgttcta 44640 cctataacta ctacctcggg ttctacattc ataagaaata aaaagtgttt accgtaaatc 44700 tttcctattc agttactgac aatcatagat atataagatc cataaggaat tttatttatg 44760 ataaatggat atgttcttaa acatgtaaga gatgatggag atttaggtta tgctttttcc 44820 caaaatttta gctcataaat atttcacaag gtaagaaaaa cagaaaaagt acaaatcaac 44880 actagagtta ctgttgtatt taatgtcaca gccatttttg gcttcattct tataaaaata 44940 aacaattata tgcatcattt acatcctaaa ctgacaaaac actaacaaca aaatcagcca 45000 actaaagtca ggtgtttacc aatcagcaaa aggaaatcag gtaatagttt cattttgtct 45060 ttgctcactt tttgttaata tttgaagatc ttcaacagac ggtggtccgt tttttttctc 45120 ataaggaatg actgtagtca aatactgcca agtttaaaaa aaaaaagtgt cacttaaatc 45180 ggaattacct tatctgcaaa aaatttgata ctgttactag aaggtctatc tgataatata 45240 aaacacaagc tactttggta atgaaaacac aggtattgta tgggaataaa gttatcttac 45300 tatctcagat ggagctcatt tttttttgcc cacctctatt tgcttcatac aaacctaatg 45360 tcaacagaaa caaccacctt tttgcatttc agttattgaa aatttcatag gactaagtct 45420 gaggatggtt tactcaattt gttcattttt aaagtgaagt attatctcac attctacttg 45480 catatttttc aacacatttt aaaaatagca atttaaacta aaaacatagt tttaacagtt 45540 ctgcacaata aagtaatttc caacagcaaa ataattctgt ttcagaccat ttttattctg 45600 gttgaacaaa tggtccacat atataacatg acaactttga aaaacacaca ctagcttctt 45660 tcagaagcta acaaagacac acttaaatcg ttttagaaat gcgttactat tattttaaat 45720 tattgggtaa gtcattgtaa acgtttttag gcattaaagg agaaaagtta aagtagagaa 45780 tggcctaagg ggaaatgaat aatgaaaatt aatgtctgaa attgagataa acttcctagt 45840 tcactacttg taaaggctca caagtaacag cgccatgtct tttaactaat cccatgaaag 45900 cataatccat aaatgattgt ccagattttc ctgttcttac catcccacca aaaaaatagt 45960 tccgtgcatt tcacatatgc tcatatagaa agccagtggt ttttacctgg aatctcttag 46020 ctccacccat tatttaacct cgtgccctaa atctcagctc tgtggctaat cctcactgag 46080 tgatcttatc ctgcccataa ctgaaaaaca ggataaagaa agctgcttta cagtaactgg 46140 atccctaagt ccagaaaagt ggtaaaactg catagttcaa tctctgagac acgttaagtg 46200 tcccaccgtc tttttgacgt agacttcttg caggtgatgc ccaattgtat aatcgaatgt 46260 ttctttgtaa agaacaacaa caacaaaagg ccagaaatga tgtcctcctg gcttcactct 46320 gcatgtgtga ggcaacctgg agaggcacca ggcggctgca ccctggcccg ctctctggct 46380 acctggctgc atcttcaggc ctccctgctc cactgacagt ctggaagatt ccttgtcttc 46440 ttttgaggtt gagagaggtg agaggcaacg gaatgaaaac aaaaactcaa acatcccttc 46500 atggagctca ttgatgagct gcaaatgtca gtgtgaggct tccctcttta ataggaatga 46560 tctcccttcc ggagaaacac ttaactacct gcagtaaaaa ctccaggcat ttgatctagg 46620 cgttcttttg ctaaacttcc tacaagctac attttgtaca ccttcttaat ctgcacagaa 46680 gagctaactt aaagactttt ctttttcaag gtgaaaattt catatagtga aaggtaattt 46740 taagcatacc cattaatgtg aatgtttccc atgttcatga tattccttca atcattatac 46800 aaaaatgtga ataaaaagca aattgctaga tacttgatta tatattcata tctgtgtata 46860 attttgtata aataatgtcg catgcctcgt ttggcagaca actgagttgg ctgactatgc 46920 aatacacctt aaaaacacat gaatttcatt taaaggaggt aagatacact ctatacatat 46980 ttttgaccta gaactctagt ctcaaacata ccgacatctg gccttcttcc cccaaaaatg 47040 acatcttgcg acttagactt ttagaattag agtcacatat tttagtttag ttattttcac 47100 aagcagattt tctacacatg caatacatta tgttaaaatg ttaatatttg tttaaaacta 47160 agagtgaaaa atggagctat aaaacacgtc ttaagaatgc atttaatcta ttgtcgaatt 47220 tgaaaaatgc tttttcctca aagggaaatt ctgttcatta tcaagactat aaatataatg 47280 aacttttata ataaactttt atttctctcg ttaagcaatg gtcttagtaa attagtaaac 47340 taccagcttt caggatcttc acttacacag aaaatgaaaa agcagagaat gaggaaaata 47400 aggataaccg agcacctgtt tctctccacc tgaatttcca aaggtgtggc ggaggccatt 47460 ctgaatgaca tttgagatcc cttccatgct gaagcgctaa aggcggcaat atacggccca 47520 gaagaaaagg acagagaaag aagttcaaag tctattagta agaatgtacc atgatgactt 47580 atgtgcccaa gaggcaagca gttagaaaat gccactactc atccagataa agcacataaa 47640 cccatgctct ttttaaaatg ctgttgcttc catttctttg caagttaaat gcaaaagcaa 47700 ctgtttttat gctactatat tcatgcaggc atttttctga tgtagctaat tgttccaatg 47760 taaatgttgt aagttgtaca catatttgtt ctatacaaaa tttactgtgt aatttttaag 47820 atactttttg atattattta cctacatttt atcagaagtc tgaaaactta agatgaacag 47880 tatgcgtatt ttcagcctaa gtttgtataa ttctaccatc agtttggaga acattaacat 47940 aacatttagc aaatgaaaat gctgttactt ggagagctga ttattgcttc ccactcactc 48000 ttcgggccac ctgccactgc cttggtgcag aaatgcgaac tagaagatgg catacgcttc 48060 ctggagagca gagtggatcc catgtgccag ccaggccccc aaaaacttct ccaaagactt 48120 ttccactccg tttctaggaa acaattctac tttctttctc ccagcaacct ccaagacttc 48180 ctagaattct ttgtactgaa aagggagtat ttttttccta aacaacttat cttgatttgt 48240 aaccagtcta caatgtcatc aagcatagta aaaagcgtct ggtggcacct ctatggcggc 48300 tgagtcaaag gagtgagatg gattcttaca gcatgactaa ttaaggggaa aggcttcgtg 48360 aaaaggaagt gaaggcctga ctcacttgat ggttccttcc cctggaattt tatagaggaa 48420 atttaaatca gtaaacacat ttgaggagtc aacataaaga atattttacc aggccccaag 48480 gggcagaaga aaggaaacca agggaatggt catcaagaaa tacatatagt ttcatccagt 48540 ggttctcaaa gtgtagtcca gggaccccct cccacccaac tcctgccaag aggtccctga 48600 aatcctctca ggagttcagg agttcaaaaa tgttttctta atactactaa aatataagtt 48660 gtctttttta ctctcattct ctcatgactt acagtgaaat attccaaatg acagactgaa 48720 tgcagaagca catttgtgaa tctagctgtc tttaatcaag ccagatgtta atgagattga 48780 caaaaatgca cagcaatgat attttttcat gattttgtaa aatattattt ttcataaata 48840 tatgctcttt ataatgaatt tatttttggt atttttcatg aatttatatt tttaatttta 48900 tcacttttaa tttgtaacat ggctaatatt aagtttcttg gggtccttga taaattttaa 48960 gagcgtaaag aagtcttgat aacaaaaagc taagaaatgc caacaggctg aatcactgag 49020 ggagctatgt gcttctgtat cagtcatgta actctatgga gttactgaaa atctgagttc 49080 caaattttac taccttacag gagactgtac aaatatgcct ctcctcctaa accatccaac 49140 ttccctacct agcctttccc agatttgcaa gccagttttg cttcagcaag aatcaaatgt 49200 ctagtcaaga tactttgctg actactcttc acttcactga aagcataact attagctttg 49260 atggaggagg ctagttataa aacaaaaacc tacagaaaac tatttctgta aatttttcca 49320 atgctatcat aattcattca tattttttaa gcagattatc aaatgagata acatcttagg 49380 cttcagagta tataataaca attatgagaa actattctca aataaccaag tcagtttatt 49440 ttttaaatta ttttcataaa atatattttc tttcaaatgt gacaaagaga agagtacttg 49500 tttttaagta cttagtctat attacctggc tccattatct atgagcatgt ttaagctaga 49560 tacaaattca taaaatctaa agctagataa aaagtttcag aatccaccta tcaggaaact 49620 aagacacact ttctatttgg aaagcacatt ccaaaaacac tagctttgtt ttcgcaatca 49680 gaaagccaat tcctgtcact agaaaaaaca aaagttgatg cagtttcaat ttcgtaacat 49740 gcagatgtaa tttaggtttt ctttattaaa tgtgtggttg cagacattac tgagccatct 49800 caactactga cagcctgact acttgtgaaa gatggctttt gtgccaccag aattttaggc 49860 ttaggggaaa ttagtgaaaa tgcaacccca tgggaagcca aggtaacctg gaaaaacaaa 49920 actaacagct gatgccagtg aaagaaggta aagtaaggaa tgcagcagaa cctctgtcct 49980 tgctgctgat ggctgctgag 50000 24 36303 DNA Homo sapiens 24 tcagtggctg cagctgacaa gaccctactg tccccagctc tttcactggc aatgaaccag 60 ggaagcaatg tgcagcatgt acagttcaat cgcctcttga tccattggaa aattacagaa 120 caaggaaatc agaaggttct acaggaaaaa aaagaaaaaa aaaaaaaact aaaatctacg 180 tctgagaacg tgctcctatc aaacgcttca gttgcaatca actcctacct ccccagaagt 240 caggcagcat caatattctc cctccactgc atttccaaac tctccctctc tactagctcc 300 ttcctgacag catcccgaaa ttctcaaacc ctcaacctct ttcctttccc aaggttttcc 360 tgaaacagct cttcccaaca tcatttattt gatctggaag accacggcca cttttcagat 420 ctctgagaca tccaggaatc cctcctccct gaaccgcttc cttggattca gagacaccac 480 actctcccag atttccctca ccccaaccct cctgctctct agttgcttct tctaagaggg 540 gccactttct tcactcatcc cttaaatgcc agtgttccct aggatcttgt gcctctttct 600 tgactcacct ctcagatgct gctgtgcctc tggattctgt ccctatttaa ttatcatgca 660 ccccctctgg gtattaccaa tcaatagctt catcttctgc ttccataaat tgatagctcc 720 actttactaa gatcagtaaa gtgctaagaa cagagactgg cacatggcaa agctcagtac 780 atgttagcta tctttactgt aatttccctg agcttcactc tgcatgttta atggccaatt 840 ggacatctgt acctattgta tgtttaatgg cctattgaca agcctatgtt cctgtctata 900 tttaatatca atatttctaa aactcatatc caaaactgaa ttcaatgtcc tcttccccac 960 caaaacacaa acaaaaaaat ttaaaaaccg gttcctcctt cagcatttta ttctcctcgt 1020 tcattcattt cctctttctt tcaaaaaaca tcctgagcca gataccattc caggtattag 1080 agattcagag tgaacacatc agccaaggcc cctcccaccc tcctgacctt acattccagc 1140 ataagggggc acaaaaagca caacacgtaa atgaagcgaa tgctgcagat aactaacaca 1200 tgccctatcc ctagctccag caaggcgatt aaaaactgga tgcctacttt ctactcggac 1260 gccagagaaa agctgcctga ggagatgaca tctgagctga aatatgacgt atacacccat 1320 cctagatgag ccacaaagga caatggacaa gggagatctt ctcagagaac aaaaccaaaa 1380 gcagccaacc gggccaactc gggagttcac gccacagcta gtgtgcagga ggtcttgcaa 1440 ttctcattca atgggatgga acacagcatc tatggaatat gtccattctc tcctcctttt 1500 tataagatgt tattgcagca accctgtctt ctctcccccg ttgtgtgcca atgtgttgca 1560 atggctggct taactttcag accataggtt gacggaaaag agagacatcc cctcatttgc 1620 tggacttgaa gctagaagca ggtactgctg acaagattta catacctccc tttggggaag 1680 gggcacatgt gctttatcta tgtttgccct cgtcaacaat caatgacctg accctaactc 1740 ctcccttctt tggtgtccga aacaatcaac ctcaccaata tctctgaaat tcaactattt 1800 atatttccat ccctactttc acttccctgg ctcaattctt acatgaaatc gctgcaacag 1860 cctcctaatt aacttatctc cagttgcctc cttccaatta gttctccttg aaaccactga 1920 agttctcttc caaaacagac aaacccaaac atgtaactac ctgacctaaa atccattttt 1980 ggttctctaa accctttaaa caaaaaccag gtgcctcaga atgtcgtacg acgccatcca 2040 ttttccagct tccactgtgc aatttccata gcctcttctt gagtttcttg ccccctcacc 2100 ttctagcttt caccataatg agcagcctgc atttccccag atgtccctaa ctgtcccctc 2160 caagtggtgg cacacaccac cccctctgct gtctgtgcct gttccctcct ccttctacct 2220 gacctccggg tgcaactcag ccttcccaga cccctaaacc tagctagggg caatagtaat 2280 atatgtccac cttgaccaca gaacagagcc aggctcagaa aacactaaaa acgtggtagt 2340 tgttttgaat atagcaatta tcacaccctg tacctaggtt tctccacgtt gacactactg 2400 acatttccag tcagatcatt cttgtcacgg gggtcagtgt agtggggtgg gatctccact 2460 agcacctccg gtctctgttc accaggatgc cagcagtacc acccccatcc ttccccaccc 2520 tgcccagttg tgacaacaac aaatgttttc agacattgac tcatgtccca tggggggcaa 2580 aattggcctg gttgagaatc accgcactaa agcacattgg atggttcatc attcaatctg 2640 caacccatgg gcagcacatt cctggaaaac tgaaattggt tctacatcct taacacccat 2700 gccagggaca caactgtgga tcaattaatg tttgcctaat agtagtataa gaaaagttca 2760 ggagatggag accatcctgg ctaacacagt gaaacaccat ctctactaaa aatacaaaaa 2820 attcgccagg cacggtggcg ggcacctgta gtcccagcta ctcgggaggc tgaggcagga 2880 gaatggcgtg aatccgggag gcggagcttg cagtgagccg agatcgcgcc actgcactcc 2940 accctggacg acagagcgag actccgtctc aaaaaaaaaa aaaagtaatc aagatatttt 3000 taatggggag gtagagtgca caagatctaa agagaaacaa taaagaaaga aaaatgaaaa 3060 ttctggtcag gttaattttt cataaattca actcaagttc tcaaaatcct taaggacaga 3120 aaaaaaaaaa tgtcagatac ttctttcata tgctccaaaa agcaaaaacc agaccagtat 3180 gtaggaggca tctgactgat agtcagtgac ttaacctgca aaacgctaca gcgatcatta 3240 ccaataccag tgggatactt ccagatttct gtatgaaacc aggaatcata gctcaggcaa 3300 aagtcttaga atcaagaaat ttttaaacgg tcaacttgga cagaaaagtg ctgaattaga 3360 aatcgaagat ctgggttctt tcccagcttt gccactaatt gtgtgacagg agggcagtca 3420 ctttactgcc aaggcctgtt acctcacctg taaaacagat gattaaatca tattaacttt 3480 aaacgtcttc ttatttgtaa atcctacgag tctatggatt aattcaatca aaaagatcca 3540 ttattaagca aattaaaaac acacacatac gcatagacac acacacactc tctctctctc 3600 tctataggac taaataaatg agtgctccag aatggcagaa ctgcacagcc agatgataca 3660 ttcagagtcc agtggcttca ttttcttatg aggaaatgac tacattctga agggagagac 3720 aggtcttagg catctttcca tccccgcagc ttgcatagtg cctgggccat agcagtcatt 3780 tttaaacgtc tcataagctg aggctgtgac actcctaaca ccagaaccca agagccttag 3840 agcctatgat cgtttcacca tatcatgagg tgtcttgact tcttgcagga gagaaagaaa 3900 aaaaaaaagc ccaaaataag tggagtatga acagaataaa gatgacagga cagaacatca 3960 agttattttt cagaagagtg aacaacggag ttctggaaaa gtccactgaa gaatctagtc 4020 taggaaagta agatgagatt aatgaaataa tcttgttggc gtggtttata aaacataaaa 4080 caatggagtg actaagaaat ctcaaacccc ggccaggcac ggaggctcac gcctgtaatc 4140 ccagcacttt gggaggccga ggtgggtgga tcacctgacg tcaggagttc gagaccagcc 4200 tggccaacct ggtgaaacct catctctact aaaagtacaa aaattagcca gacgtggtgg 4260 cgggtgcctg taatcccagc tactcgagag gctcaggcag gagaatcact tgaacccggg 4320 aggcggaggt tgcagtgagc caagatcgca ccattgcact ccagcctggg caatagagcg 4380 agactctgtc tcaaaaaaaa aaaaaagaaa tctcaaaccc ctagctagga taatcaaaga 4440 gtagcaagta ggagatcagt gaactagaag gataaaggaa aagagagacc agggcagtga 4500 gaccaggaag aaacaggaaa cgagctgtgt agagcatact gtgcattctg ttttatagct 4560 gccctgtaaa ttccaaaaaa taacacaccc ttttggaagg caaaatgaat tttaaaaaga 4620 taaaacctca gaggcaggtt aaatatactt tttagtaaac gtcacctcat ttagaaacac 4680 tacatttaca accttgagag cagatgcggt attattaggt gcttatggtg cttaactggg 4740 ccagagggaa gattttctct atgcttaaat tttataagta ggaggacaag gaaagggaga 4800 gacagaaggg gtaggggagg agggagacag agtgggaggg ggaagggagg gaagcatttc 4860 tttgataaca tttctcaagg ttaaagtacc aaagacatac caaagcgcca acagaactaa 4920 gtctcagggt tatgaatgct gaatgaaatg aaggacatta aagatttgtt actaaataaa 4980 taagtaaata attttttaaa agaaaagaaa aaacaatttg ttacagttta atttaaataa 5040 ctgcaataac gcagtatatc agaaaaccca atcctccctc catttaatct accatttcca 5100 agtttgaaat aaagaatcca agtgttcaaa ttcaaagtga agaactggtg aaaattctga 5160 atctgaagtt attttgtaat tgattcatcc atttcccact acgtctttag gaaggagtta 5220 atagtgctat aaaatgcccc ctctcaggat ggaatttttg ataggagccc atttgtgagc 5280 agggaaatga ttaagcatta cagtatttac tttattgttg ccctcactac tgacaaatgc 5340 caaagtaatg tggcaaggac ggaggaagag ggtattcaat acacagcttc aacaccagta 5400 tttacgctga gaatactcac cactgcctcg tggttgtttc cttccctggc ataagtatgt 5460 attgaaagtc tcaaaatcag tccttatctg gaaacttttc tcagacaaaa ccagtagcaa 5520 caatgtataa acaggatata gatttataaa aattcaccaa aatctgaaag acgaagaaat 5580 gggccaagat ccccaaggcc catttacaac atccttgcaa agctcagaaa acgaaaattc 5640 aagccaagga tccttccatc cacctctaaa taactccaca tcctcattcc aatgcatgct 5700 ggttctgtga gctaaggtcc ctgttaagct tttgtttact catttatgaa atggaaataa 5760 taacaataat accttcttta tagagttgtt gtgaaggtta catggaataa tccatgcaag 5820 tacctagcaa agtgctcagc aaatatcaat attcaaaaaa gtattggcta taattcctaa 5880 aaataaaaag gatagaataa tattttagac acaactccca aagagaaata accacacctt 5940 tctacttttc tccagaacag caaaaacagc atttcaagtt cttctaatat aaataaaatt 6000 gcagctgaga aaaaataaat ccccgattat agttcctagc tgataggact gaaaacaaca 6060 taaagagaca ttactatcta tgactaagca cagaatcagc aaaaaagctt aaacagaaaa 6120 caagttaaca cttaaataat tcggtctttc tgaacaataa aaagaaaaga gctagagaac 6180 actatgctac ataaaagtga acaagaatca gatacagtgc tgcagtttgt tagttcttct 6240 gtggtatctt aaatatatca gcttcttcta cacacagcat attcaccagt tcaaccagct 6300 tcacagaaag caacatccaa gacagggaaa agcaagacaa caaagggctc aggaaattta 6360 cactattccc gctatcaaac ttgtttctca gcctcacctg acaccaactg gtacagattt 6420 ctaactggcc aggaaatcat tttaaaatct caatatcaaa gctcaatgac agaaaaaagg 6480 tttgaaaaag cactcaagga atgcttttca tctatttctg actcttactc taatctactc 6540 ttacttcatt aatattcttt atactcagga ggttcaaatg aaagcaaaat gaaaacataa 6600 ttattgttcc tgataaatca cgacactttc aatagcttga atatgatttt caaacattat 6660 aaaacagtag gtttaatttt taataaatta aaataatcaa aatgaggaaa caagtgtaaa 6720 aaatttagag taaaaatatt caaatagtta agatattatt taagatgtaa aagtaatatc 6780 ttgttttaat acttcaaata attttaatgt gaattataaa gcagttataa aagcaaattc 6840 taccaaaaat ttttttaaaa aaacagatct aagtaaatta gaaaatatac attataccaa 6900 agcagataca cacttgaaaa tcttcaatat taaacaactg ttatatgagt ttatttttat 6960 ggcagaaaaa atataataca aagatcttca aaattagcat taaataattt aatttttatc 7020 tttgaagatc tttttaaaat tttttatgta acttaatatt aatgtggaaa ctaaactaat 7080 tattgccaaa tattttcaaa accactaaaa ctagaatata ataattgatc ttggcaattt 7140 tagaaattcc ttcctgaaac ataaatactc taagtacagt catgtgctgc ttaaaaatga 7200 ggatacattc taagaaatga gtcaggcaac tttgtcgtat agaccatgac agagtgtact 7260 tacacaaacc tacataattg tatagtctac tccatactta ggctacatgg tatagtctac 7320 tgctcctagg ctacaaacct atatagcctg ttactgtact gaataatgta gacaacagta 7380 acacaatggt aagtatttgt gtatctaaat atagataaac atagaaaaag tacagtaaaa 7440 atacagcatt ataatcctga ctccatcatc tatctatacg cagtccatga ctgaccgaaa 7500 tatcactatg tggcacaaga ctgtatacaa gaagaacaca atttcctggc acattacagc 7560 ttttatgtga acttgtttat attaacaatg acctctctaa cacataaaga agtcaagaat 7620 ggactcctga tttttactac attaactgat ggatcatgaa agttcactat taccagccta 7680 ccaccaaaaa ccattgcaga aaacaaagtg tagatgataa aaacatatcg aaaactgatg 7740 atgacatata tttctctagg taatattatt tttcataaaa caaatttgtt ccacatctaa 7800 agtaattcat aattcatagg tttgaattat taaaataata gccattatat agtcaagttt 7860 ttctccaccc tacaatgacc ttctatcatg ctaatagtaa ttcttagtaa atgctttgca 7920 ctaaagaaaa tgcagcaaga tactatcaaa tatcaccaac aaattattgt taaaatgtgt 7980 caatatttac attcaagaat gcgtatcaga gccgggcacg gttgctcacg cctgtaatcc 8040 cagcactttg ggaggccgag gtgggcggat cgcttgaggc cagtagttca agaccatcct 8100 ggccaacttg gtgaaacccc atctctacta aaaaaaatta gccgagtgtg ctggcgggcg 8160 ccgtcatccc agctactcag gaggctgagg caggagaacc gcttgaaccc aggaggtaga 8220 ggctgcagtg ggcccagatg gcaccactgc actccagcct gggcaataga gcaagaccct 8280 gtctcaaaaa aaaagaatgt atgtcaggta gaatttagta tctcaagcac cactgcttag 8340 ataaagggga aaaaggctgg gcactgtggc tcactcctgt aatcccagca ctttgggagt 8400 ccagggcagg tggatcacct gagatcagga gttcaagaca agcctggcca acatggtgaa 8460 accccacctc tactgaaaaa aaaaaaaata caaaagttag ccgggcgggg tggaccacac 8520 ttgtaatccc agctactcgg gaggctgagg caggagaatc gctggaaccg gggatgcgga 8580 ggttgcagtg agccgagatc gcgcccctgc actccagcat gagcggctga gtgagactcc 8640 gtctcaggaa aaaaaaaaaa aaaaaaaaaa aaaaagataa aggggaaaaa aactcatgcc 8700 ttttccactt ctcctctaac cttcttttaa aaccacattt tatcactgtg tataatacaa 8760 catttttatt ccttttgttt gtctccatat ttagagaaca ctggtctacc tgctcactct 8820 ctttacctag ccaccgggat tcagacacac acagcacaga ggtctgctgt tagacaagaa 8880 gctgccacac gtagcttgga gattggctgg ccaaggtgtc actatcactc aggctgctgc 8940 tgaaggggcc ccaagaaaag gaacaattca acataatctc aagataaaac aaagtaacct 9000 tttaaataca aactttagaa acattacagt actcttttta tctatgcaaa ctcacttaac 9060 tggagttcat aactttttgt tctacttttt aattttacta agtctaaatc agagattggc 9120 aaactttttc tgtcaagagc caggcattaa ttattttagg ctttgtaagc cacacagtct 9180 ctacggcaac cactcaactc tgtccatata gccccaaagc tgctacagat gatatataaa 9240 tgaatgaaca tagctgtgtt ccagtaaaac tgtattcata aaacaggtga agagttgatt 9300 tagcccttgg acagtagtct gccttgacaa tccctggttt aaatcatctg agcaaatgtg 9360 ctgttaaaaa aattccaaat taaaaagaag gggggaaaaa gccaggtcca tgttgtttta 9420 aacagaagcc ccaaacaaag gctatcaatc atattataac atgagctgag tcacagaaac 9480 ggagcaatgt agctgtgaaa gaaaatatat acaagaaaga gtgtatatca tctaagccct 9540 agctctcccc catccttcag caattctctc cctaaccccc gaaaagcaat aaattagggt 9600 aatgtgattc tttttacagt tttttctttt ctctactacc gttagtcact gattgctaat 9660 atcccaaaag ttgtccgtat aaactttttc agggttgact gacgcagtta tgcatttttg 9720 atggtggagt gtaacacagt agaaattctc tcttcaaagt tcaatgtcat tatggcaaac 9780 agaaaagact atcgattcct atcttctcaa caagacttag cataaacaac cttacatgtg 9840 acattcaatt caaagatttg ttactagata ctcagagggt aactattcca ttttcataca 9900 tataagttta ctaagttatg atgctaagca ggtaggccat aatgcccaga caactgcaga 9960 taacactaga tgaacagcaa ctactaaatt acaacataac ctggtgttgt tcaacctatt 10020 cttgtattca tgagaactgc tactgaaatt caatgacctg tgaatgtctg atgtaacaga 10080 agcctggctc ccccaataaa aatccccaat agaacaataa taattcaaat actacaaatc 10140 taaatcctcc gggattcaat ctcctgtaca ctgaagtttc aattcaacaa acatgcatta 10200 agtgatgtgc gaagaacaag aaattaggtg aggtgctggg aaagcagtgt gaatgagcct 10260 ttaaagagct tgcactctgt tagaagagag gtttaagcca atgattacaa cacagtgtgg 10320 tacttcaata tcacaagtat gtgtaccctc acaaataaat atgaaaacac ctcataacag 10380 gcatataaac ataataggca actcataaaa gaatacaaat ggtcaacaga tcaaacccat 10440 tgtttgcctg tgcagtgtat attcaaaagc tcttaatatc tcctgttggc gggaggtgtg 10500 ggaaaatggt cactcttacg ctctactggg ctacacttct acacaataat ttgaaaagct 10560 ggtcatttaa agccatttaa aactctatat tctttgactt ggaaattcta tcagtttcag 10620 tgagtggata aactagtttt gctaaccaaa agaaaataat tcaattttag gaaaatggta 10680 actttggttt aatagtataa atttcagact cctacaagat atccaaaggg aattgtctag 10740 taggcagttg gacatatata acaattagca tttactcatg gtgtttgcta tgtttgctaa 10800 tcttatatac attatttcat ttaatctctc caaaattcaa agccgtctgt atgtaggacg 10860 aagattcaaa cacagctctt aacccaaagc tcttagtcaa tattctattg tcttagacaa 10920 tattcactgt ccatgggttt gacacctggc agaaagatcc aatatagaaa tatatatttg 10980 caagtttatc tttcgttcac tcattcattc agcagaaatt tattaggtac ctaaggttgg 11040 atctacgaat accgagaaag taaagttaaa tttcttatca ttagggacat aaaacagaga 11100 taggcatgaa aacaaaaaat tgcagtatga tacaacagtg ataatagaaa tactaaagag 11160 gagtaacact aacccctcta aaaagagaga gtagtcaact ctgcatcgga cagagtcatc 11220 aaggaaggtt ctaagaaagg aagacaatgc ttggccatga ttttaaagga tgagtgagaa 11280 aaaaaaaaaa tcccccaagt ggacagaagc agcaggtaag gaatttgatt ccagaagaag 11340 agcattgaat ataaagaagt gtcacccaaa acagcatggt cagcacaaag aaatacatgt 11400 aggtcaccat agtgataaac agacagtgcc ctttaaccac tctctaaccc cccatcctct 11460 gctcctccag atgctgcaaa aaaggacaga ggaaatataa agcctgccac tctcttgagt 11520 tgcacttccc aaagaggtcc actcctttaa cctcctcttc aaacggagaa ttggaaaggg 11580 actaggcagg ctcccttgtc aatgaggaga agcagtggtg tagtgacact ctccaaaaca 11640 gcttcccaac agcatcacca tctaccacct cacccagcct atggttttct tcatggcctt 11700 tatttactac taaaattcca tcatctattc attttttcct tgtgtattgc ctttctccgt 11760 cattagacta taagctccag gaaggcatgg gatgggcctc atctattcta ttctattcac 11820 cactgtggtt tttctgtttt ttgtttgaga caaggtcggg ctgtaagtgg agtggtgtga 11880 ccatggccca ctgcagcctc aacctcctgg gctcaagcaa tcctctcgcc tcagactccc 11940 aagtagttgg gactacaggt gcatgccacc acacctggct aattttataa tttttatttt 12000 gtaaagacaa ggtctcacta tattgcccag tctggcctca aactcctggg ctcaaggaat 12060 cctcccacct cagcctctca aagtcctaga attaccatca tgagtcatcc tgcccactgt 12120 atcttgaaga acacctgcca caggaaagca ctccacaaat ccttgctgaa tgaatgaata 12180 ttgtttcatc ttacccaaca tacagcatta tggttatttt aaacctaaaa ctccaatcaa 12240 aatcatattt cacttttata agataatact ttctgtaaca tttcacatat tttcatggtc 12300 taatgaaatt ttccaatgct ctaaatagca tccctatgcc ctctattttt tttttcacca 12360 caacaaatat ttaaaacaaa atctgtctcc aagtttaact ctgctctata cccactccaa 12420 tatttcatta gagtatcttt ggaccaaaga gaacaaggta acaattttag attaagtggt 12480 atgccaacta caaattgatg tactgcttca gtgaaacata catgaagaaa aatggaagct 12540 attctatact taacgaaaaa cttcaagtcc tgctaataaa acaaaaatat ctcaggcagg 12600 agaaaaaaaa aagtgactta tgttattgga aaaacaggta acatatgaca attctaagga 12660 ggtcaccatt ttttaggaat tagtatcact gataatagca aactcataga gaccaggtct 12720 tgccaagact gcatgtgata taaaatgctt ggtaaaatgt ttggcaagct cctggaatct 12780 gtttatccaa ctgcctattt cactttgggt ttcttttgag caacctacca ctattttcag 12840 aatcaccctt ccagcctact tgtgtcacag ataggaaggt ctttccctgg cattttaggc 12900 tttttatttt ttttttttga gacagagcct cgttctgttg cccgggctgg agtacagtgg 12960 agcaatcttg gctcgctgca acctccgcct cctgggttca agcgattctc atgcctcagc 13020 ctcccgagta gctgggatta caggcatgcg ccaccacgcc caactaattt ttgtattttt 13080 tagtagaaac ggagtttcac catgttggcc aggctggtct tgaactcctg acgtcaggtg 13140 atccgcccac cttggcctcc caaagtgctg ggattacagg cgtgagccac cgcgcccggc 13200 cgcattttag gctttaaaag gatacaggaa actagtactg caagcttagc cgtgccatta 13260 ctgaactgat taaaattttt gacacaaatt gtaaatcagg caaaatgcaa cttcaaaagt 13320 tcaaatatta tgaaaaatat ttactgttac tatgaatcta tttcaagacc aatgaaggct 13380 gaagtaagac ctaactgttg ccttgccaaa tccatttctc aatataaatg ccatgtaaaa 13440 ttacacaata gatttactct aacccttttt acatgggagt tacaaaacga agcctttaaa 13500 cccctctagg gataaattta cttttctcag ccaagtgtta acagcaataa atggaggatt 13560 tacatgtaat ccctaaaagc aaataggcag aaattaccaa ccatgctatg actttttaac 13620 taagcatttt atttaatctc acttgcataa cagggcaatt taaagggtgc cagttctaaa 13680 aatatgagat gcaaatgcaa aatagtgtcc ctaatatttt aaagcaaatt ttctaggatt 13740 gtatattcaa acaaatgtgg gttggttttg ggtttttttc agagtactca ctaggaataa 13800 ctaaataaac acctattcaa tcaatactgc catttcttaa aaataataga tacctgagaa 13860 cttttttcaa agtcagtgca taagcaatgt cctttttttt tttttgagat ggagtttcgc 13920 ttttgttgcc caggctggag ggcaatggca cgatcttggc tcaccgcaat ctccgcctcc 13980 tgggttcaag caattctcct gcctcagcct ccagagtagc tggaattaca ggcatgcagc 14040 accacacccg gctaatttta tatttttagt agagacgggg tttctccatg ttgaggctgg 14100 tctcgaactc ctgacctcag gtgatcagcc cacctcggcc tcccaaagtg ctgggattac 14160 aggcgcgagc caccgtgccc agccacgatg tcctttcaaa atagatacaa atgaacatta 14220 cactgaaaaa aatggattta ttcataggct aattacagca taataaatga gctatatggt 14280 aagactcttt ttgtgttaac tgctcattgg tggcccacca tcctagaact gtatttccag 14340 gattccatca gcatctgctt aatgagatct atttctctgc taagtattct ttagataagt 14400 aaatcagact ttatcaaaaa cagaactcat attattgcaa gccataatag ctagtttgtg 14460 atttgtaaga ccatgttatt gatatcacaa attaaaaaga gatgtcccat ttagaaaaag 14520 agatcccttt acccaaccat catactctaa aaataaaacc aacatttctt cacacacaaa 14580 caaaatcaag gtgggaatac acaagacttc aacacaaaaa aatcttatca aagggtcagc 14640 gtttcagatg ttctaaactt gttttcacca gtcagaaata gtctagtctg ccaaatacta 14700 ggccacatgg ctgatgtcac ttaccttctt gtattccaac ctcactgaaa aatcaggagg 14760 aacctgttta caaacaattt gtgccttgag cagagttctg atacctctaa ataaacccct 14820 tctaatttct tatcacacaa ctcgtaagag ttcagcaaaa ataagaaaaa aactactagc 14880 caaatataca taaccactct tcacattttg atgtggtaaa aatgttcact tgctaaagct 14940 tcttttcttc aagacattct ccttttttgc cccctcttcc ttccacttac atccttgcta 15000 aacatttcag tattattcag ataaccagta caccctagtc aagaaaacga ctttctgaag 15060 aagctgagca cagcggtatt ctggaataaa atcccactgg gctctgatta gaagtgttaa 15120 cactcctgcc catgtacaac ctgtcgcacc tgccaccagc cccaaaggct gggactacca 15180 cagaacctgt ttccagtgtt cgacgcctcc tgcaaaacat cacttactgt gccgggcata 15240 tgacttctct cattcttccc attctgagag ctgccatttt ttagtttctt tttctttttt 15300 gctgtttctt tgtgctcttt ctgtttgttt tgcacttcaa taagaacaga aataaagctg 15360 tttttcactt ccttttcaaa ctccagttca tctcgtaaag ccaactgctg caccagctcc 15420 tcagagtact ccttaatggc agtctcaatt tcttccagga tttcatttaa ctcagacact 15480 gagagccttt tcactcctgt gaccaaaagc aaaatacagc agatattaag acaaaaacat 15540 ttcaaggaag caaaacagag tagtcactag gaatgactaa ataaacacct atgtaatcaa 15600 tgccaaagag caaaaatcat attccaagca attaaattga ctgtgggtga tcttctcaga 15660 attctacaac ttctggagat cagatctgag tagaggtaag tgggcaggtg agtcacttct 15720 cagcttgctg gcaagaaagt ctaacaaaag tttttacttg tgcaaataaa tcattgcatg 15780 aataatactt cagtcaagac tgtgaaggct tgtacacgga aaagaacaaa cacaaaagtc 15840 gagggaaagt acagaatcca cctgctgaca aaaaactgca agccgttcag cagggttctg 15900 ggatatctct gaacaggaaa ggagttgatc tgaaagaatt aaactaaata tacacttaat 15960 tacataagtc ctgatctgtg aattttgttg aaattggtga tatggtttgt atctgtgtcc 16020 ccacccaaat ctcatacaga aatgtgatgc tggaggtggg gcttggtggg aggtcactgg 16080 atcatgaggg caatttctaa cagtttagta ccatccccct ggtgctgttc tcatgacaga 16140 gttcttatga gacctggttg tttaaaagtg tgtggcactc cccccagccc ccttcctcct 16200 gctccagcca tgtgaagata cctgctccag ctctgcctcc caccatgagt aaaagctccc 16260 tgaggccccc ccagaagcag atgctaccag gcttcctgta tagcctgtgg aaccattagc 16320 caattaaacc tcttttcgta taaattaccc agtttcaggc attttatagc agtgcaagaa 16380 tggactaaaa aaactgggaa gacaaaaggg aacaatattc cctgaaaccc catttcagtc 16440 ttttctcagc acttctgccc cttgtgacag ataggacact gttaaagtcc caacccacag 16500 ggcctcagaa tgtggagagg gggtctttaa agaggtaaat taaggttaaa tgagggcatc 16560 aggataggcc ctaacccaac atgactggta ttcttataaa atagggtgat caggacacag 16620 taggaagacc acgtgaagac atagggaaaa ggcagccatc tgcatgtcaa ggagagaggc 16680 ctctgaaaaa aaccaaacct gccaacacct tcatctcaga cttctggcct ccagaactat 16740 agaatataaa ttcctgttgt ttaagccacc cactctgtgg tactttgttc tggcagccct 16800 agcaaaccaa tacagatttt agtatctgta tcaggtatca ggactgtatt tgggagtcag 16860 gacgttaaca ctgtttagcc catacctaca ggaacatggc caaatcagac catctgaaag 16920 aatttggtct gtctgaccat ctgtatgttg aagccataca acccaaaggt ctttaggtaa 16980 ctaagctttg gactgggcat gccataaaga taaaactagg tgacaacacg agttgagcag 17040 atacagaccc catcagtgcc tattcaagga agggaccagt catatggcaa tgattagata 17100 cacaccttcc aaccagcctc aggctgttca caggctggtt atatttattc agacattttt 17160 catgaaaatc aactgtagag tttctagagt tcattattct cacatatgtc aaacagatcc 17220 atggttccca aaattcttca tagtaacaca caagactatt tgtacaaaca ggaattgcat 17280 ttgacaaata attactgaat atctattcta cacttcaaat atgatggggt ttcttataga 17340 ctcaattgtg tcctccccaa aattcatgtg ttgaagtcct aaccccaagt acctcagaat 17400 gttacagtat atggtcttta aagaggtaat tacattaaaa taaggtcatt aggatagacc 17460 ctaatccaat atgattgctg tccttgtaag aggaaatttg gggccaggca atggtggctc 17520 acgcctgtaa ctgcagcact ttgggaggct aaggcgggcg gatcatgagg tcaggagttc 17580 gagaccagcc tggccaacat gacgaaaccc cgtctctact aaaaatgtaa aaattagcca 17640 ggcgtggtgg cgggcacctg taatcctagc tacttgggag gctgaggcag aattgcttga 17700 acccgggagg cacaggttgc agtgagctga aatcatgcca cttcattcca gcttgggcca 17760 aaaagtgaaa tcccgtctca acaacaacaa caaaaatgta aatttggaca caaacacaaa 17820 cagaaggagg atgaaatcaa agctttctgg agatgatcat actatacatt tgatacagac 17880 aaaaagcaca gaaatcattg atttcagcag gctcccttac tctcttcata actgccggta 17940 ctagacctct tgagagtttg aatttcctgg gaaagcattg aaagccgatc tgactgtgta 18000 ggggtttcat catcttctgg gtccggtgat tcctgcatca tttcttcaat ttcttcaata 18060 accttcgaaa acacacacac cactgttaat caaatataca aaaggaaaat gatctatctg 18120 gaacacagtg ctcacctaag gcacccagag cagaaatgca ctgaattctc cattaaccaa 18180 tggtgttcag gtgtggatgc tccaaataat caaaattttt actgtaggag tgaacatggt 18240 ttatcaagaa aaaattacaa tatactaaac acaaacctaa atagaattgg atttactacg 18300 tccatcatga ttcagaaggc cctaacatgt cttgcaaagt ctaggtaatc gtgctacaaa 18360 cacataataa tttacttcag acacaccaga gctagatcat acagtaatcg ttagaagaaa 18420 gcaaataaca gaaaatttac catagacata caagtgctca atttaaaggt atctcatcaa 18480 aaattctgac ataaaattta taaaatacaa tagcacaaaa aaagtatata taaatgggaa 18540 attacctcta tgacagcaaa atattctttc aacagcagaa aaataatttt atatattaat 18600 tcaacgaaaa aaagggattt tttaaattac tcaaaaagta gaagtccaaa gagaaaaatg 18660 ctgtgaaaag cagaaaaact tcagaaatac atttgacttt aaacaatgat ctactgtcta 18720 gtgcagtgct gctcaaatct tacaacagta tcttgccctt ttccagattc taaaagttga 18780 gaaggatata gaactcagcg aaaatgtaaa ggaaacccag gaaaattact caatagttag 18840 aaaaaaggac ctaaagttaa aaaaaaaaat taaacacaat taaatgaagg agtaagacat 18900 catgagcata tttccatgct ttatgaatta acaagcttaa caggtaagat ttccaccaag 18960 gaaaaacaaa ggcaaatctg aaggagatgg gattatgtta attataagaa gagtctccaa 19020 aacaggaaat ataataaggt acaatgagcc actcaaggat cctacagaat ctgttattct 19080 aactcttaaa aaaagggaga gtttacggta ttaggcaggt cagtgagggg acaatgaatt 19140 ctcaaggtca tgtctatccc tagattaccc tctacctcaa aacaaccttc caacctacta 19200 ttaccttgct tttttaagca ccacattctt cccacgccta gaacaattca accacaaata 19260 tgactaagga ggatccaaat aacacaattt taatgtaaat ttttaatgta ctcctcccaa 19320 gcttgccata atgtatctcc ttactgatct catctttcca gcctaaaaat agagtgtggt 19380 agaactcaca gtccgagtag ccatggaaca ttttatttca ggaagaagtt ctggcatcac 19440 atttcaacaa tacaacagag gtgggaagag tattcagaaa atgagaaagc tgattcgact 19500 ggttttaagt gaatacaggg ttgaccttat tagcgcattc cacgggaata atgctcactt 19560 ctttctgggg cacacaatat gttactttaa tacatgctaa tgctttttac ccagggtttt 19620 gaagatacta aataatttct acacagatac tttacagttt gtccgtgttt gggtaaatct 19680 aaagaaacaa ggaaattcac aatatttttc tggtttgggg ttaagcaaca acctttcctt 19740 ggtaccttgc tataatcttc aacaaggcct ctgttggaaa ccttcccaac taacataatc 19800 ccttctgctt cagatcacaa atacttattt ttctctgtgg aagagttaac tattaagcct 19860 gtataaccag aaaaaaatag gaaagataag taaaccaaca gccatcattt actgtcataa 19920 caagaagccg tttttcagct ctagagtctc ctttccttgc gttgctgagg atctcctcta 19980 tacctggtct gccgtgaaga ggggttcatc attaacacag gagacgatga ttgagtgcat 20040 atccagctgt tctctcagct cttcatcatc tgatgtatca aagagcaaac tgtcacttac 20100 ctaaaaacaa aaatacccat cattaaaagc aggacatcct catcaaaaca aagttaacag 20160 ctctgagtaa aagaggctgc agagaaaagc aactactaag agccttccct caagagggag 20220 aaattattac aatcactctg taccaggcaa tgagtatata ttcatcttga acagaaatga 20280 gaagaatgta gaataaatat gaaatcactg agatgctgca taagactgct ataaaagctg 20340 acagatatgc agaaaaaaat acggtattca ttagcaagtc atagggaatt acagtgaata 20400 gaagggagaa tatgggcagg gttagcaaca cacactaaga ttggtagacg ttaacattct 20460 aaagaaaaca taaaagttag ttacttgcaa actaacttct aatatctgag taaaagatga 20520 tcaactccct gaaaggcggg atctgtccat gagcgtcaca cttagccctt ccaattgcac 20580 attccttctt aacaaaaaat aataacccta ggttccaaaa acatctttaa acctaactaa 20640 gctggtgatt caccactcac ctcctcacca tcttccaccc tgctctgcgc tccaggagga 20700 cgacctttat ggaagacatc aaaggatgcc cttgctgtct ggcttccagg tgggttaaac 20760 cccattggca gacactggca gactggaagg tgggatggtg aaggctgggc atgtattttg 20820 ctgtcttcct ccctgtcaga ctgtcatttg ttgactatgg ctcttcacca aagccatagc 20880 tctgattagc agacttctcc cacaaggcca atctcaacag tttatggtaa tttctccctc 20940 tccttacctc ttcatgccca ggactgctaa agtaccagtt tctgttgcca gccctggggg 21000 attgcaccac atcaagtttg tttcccttaa ccctgcccac accaatataa acagccttag 21060 agtttagtac cagtactgat tgtaaaaact ttgcctgaaa aaactcagca gattaaaatg 21120 aacaagggaa catctgtcta tcatctacct atctgtcatg acatgactat tcaggtgttg 21180 aatggaactg aaactagaac tctattcctc aggaaggatt agcacacact gagtttccaa 21240 agaataaatg ggcaacaagg attacgtaag aagccaccat gtcattctat tacatagtgt 21300 agaactaaca tgataaaatt ttacctggca aaattattta tccatgcaga ggatatatgt 21360 ctaaagctga ggacacaatg gaatggatgt aggttcctac tgcttcccta gttcttcata 21420 acatgagcat ctgaagtttg caggtgaaca cagcctacag agtaagaaac ctgcacgtga 21480 gaagggccaa gggaagaagc actggcacac tgactttcca acattaggct gggagaaggg 21540 ccccatcaac actgcacggt ccacctacaa ccctataatg caactccagg ctcctgcgca 21600 gtgacgacag cccttctcat tttatttctt actgcaccat cagcctagca aaggcctggc 21660 catgaccttc aacacagcat catgaactaa gaacactgtg tggaaaaaag gagaatctaa 21720 aagcttccag ggagggaaaa aaaaatcaca ttaaaaggat caggaatcaa aatgtcaagt 21780 gaattctcaa tagcaacaat agatgctaga aaataatcga tgtaacttcc atgtttggag 21840 aaattccatc ctagaatcct atatccagct aaaacaagta ggcaggggct aggataaaga 21900 cattttcaga catacaagtt ctgaaaaaat ttacctctca tgtgcccttt ctcaaaaagc 21960 taccaggagg aaaatgaaaa agaaaaaaga tatgtggtct aataacaagt gacaaagaga 22020 attctcgtaa tgatagtaaa gtgaaaatcc cagaatgatg actgtaaagc tacactagag 22080 agtaacattc agattagaat ttttattgca gagagcatgg agaagaggga gaggtgggtg 22140 gggagggaga gcagaaggac tagatagatt acctagtgcg gtccatttac taaaatgagt 22200 tctacggctc tattgaggct gaactggtca gctaattata aactaaataa accacaaaaa 22260 ggcaggtagt atcctaagaa aaaacctaag tagaaaaaga gaaaatgtaa acataatata 22320 tcataatggg tcaactgtgc acatttaaat agtaaatata aatgatgaag attaattcaa 22380 ccccagatgt tatgtaatta cattgggaag atggggaaag ggaagttttt gggagggtat 22440 agtgtcaggg attggcaaag gacagaagag agctaaacct tcaccctctg ggtagaaaat 22500 aaataaagtc taaaattgag aaaataatat caagtaaaat ggcatgagca tgttatctgc 22560 taaaagaaat agctgatatc atttaaagtg gccacctctg gtaaggagaa atcaagtgga 22620 gcagagactg ccttttcatt taagtcctat agttctaatt tactttttaa actatgacag 22680 ctgttccttt aataatttaa ttttttaaag aaggagaaag atcaagagac taggctcttg 22740 atgagagggg atgtatgatt tctatactgt agaagatgaa aatgtggcac actgtgatca 22800 caaaggcatg tcgataccat ggagtagacc taccgattag tttcatcgat ccttacactt 22860 cgaacacttc acaccccctt cccaatcact cttcctatgt tgcaacttga ggaccatgat 22920 tttgttttat ttttcccctt ctggcttata aatactaaaa gttttaattt taaaattaat 22980 ccggggtggg cacagtggct catgcctgta atcccagcac tttgggaggc tgaggcgggt 23040 ggattgcttg agctcaggag tttgagaaca gcctgggaaa catggtgaaa ccccattgct 23100 acaaaaaatt agccaggcat gatggcacat gcctgtagtc ccagctattc aggaggctga 23160 ggtgggagga tcacttgagc ccaggaggtg gaggttgcag tgagccaaga tggtgccact 23220 gcgctccagc ctggacaaca gagccagacc atgtctcaaa aaaaaaaaaa aaaattactc 23280 ctaaggaagc ccaggttgcc atggctcaca cctgtaatcc caacactttg ggaggccaag 23340 gtgggaggat ctcctgaggt caggacatca agaccaccct gagcaacaca ggaagacctc 23400 gtctctacag aaatatattt tcttaaaatt agccaggcat ggcggtactc gtgtgtagtc 23460 tagctactca ggaagctgga gcgggaagat cgcttaagcc caggagttgg aggctgcagt 23520 aagctatgat ggcgccactg tactccagcc tgggcaacag agggagacac tctctcaatc 23580 tatcaatcaa tcaaattgct tctgtggaag acattatctt tttcttattt ctactcttaa 23640 aggtataaag atgcatacaa aagtccctgt cctgaaagaa atgcacatgg aaggagaaat 23700 acagcacaga agatgttctt aagctgaaat gaatgataag cattctatca tataagtata 23760 ttcaagaatg gcaccagatt tagaataggg ttatggccaa aacacatgag taataacagt 23820 aaactattat aataaatttc catgacaaca gaaatgatcc cagatcatct ttccagagag 23880 agagccaagt ccattcccca cagagatggc atatgctaag agctggaata tctttcaggc 23940 tttgcaaact agccctggct gtcatgggta tcattattca ctcctccctt cagcattaaa 24000 tgtgccttct ctttgccaag cattcccatc actccttcca atgccagggt tagatacaga 24060 cctggtggta ctgtttccct gtataaaact gaagatacat caaatggaaa aaatgtgccc 24120 acgtaagaat ttgaaagata ccagttgcct tactctaaaa aagacatatc aaatttagaa 24180 gtgtgccatc ttaatcttca atcaaagaaa aatttaaaac tgcaatctag caaactagtt 24240 ttcctgaatg tttttctctt tttaatgcaa aaaccattct tcttttgtca aaaaaaattt 24300 ctatacactt ggttttataa gtaaaataat ctaaagaatg ctttcaaagg gtaaatttta 24360 catagcttag tcacggcacg aacagagcac caggtggcaa tggtaaatac agtcactttg 24420 ccagattaaa aaaaaaatgc ttaattatat accatgtaat agaatgacgt gatgacttct 24480 tacatagtcc ttattgccca tttgtttttc agaaactcag tgtgtgctaa tcctccctga 24540 gaaacagagt tctgatttca attctattca tcaccagaat agtcatgtca caacagagag 24600 atgataggtg aatgatcacg tgttcacttt catctgctga gtttttccag gtaaggctgt 24660 tttttttttt attttttgag acagagtctc gctctgtcac ccaggctgga gtgcagtggc 24720 gtgatctcgg cgcactgcaa cctctgcctc ccaggttcaa gcaattctcc tgcctcagcc 24780 tcccaagtat ctgggattac aggcatgcgc caccacgcct ggctaatttt gtatttttag 24840 tagagacggg gtttcaccgt gttggtaagg ctggtctcaa actcctgacc ttaggtgatc 24900 catccacctc agcctcccaa agtgctggga ttacaggcat gagccaccgc gcccagccag 24960 gtaaggcttt ttaatgtcag tactagcatt aaactctacg tcattactaa tcactgcctt 25020 ggccatatca ttaccgtggc attgaaaata acattcaaaa ggcagcaaaa gggagcaacc 25080 tcccaagatg caatctaaga acaatacaag acatacagca gtccctcggt atccaaggga 25140 aatttgttcc aggacacccc aatagcaaaa tctgcagatg ctcaagtccc ttatataaag 25200 tggggtagta tttgcatata acctgtgcac atccccctat aagctttaaa ttatatgtaa 25260 attacttata atacctaata caaggtaaat ggtttatagt tatattgtat ataactatac 25320 aatatttata ttataaagtt atattgtatc ggttttcggt tttttatttt cattattttc 25380 aattgttgtg ttgttatttt attgtggggt ttttttttca agttttttaa tccacagttg 25440 gctaaagcag cagatgtgaa accagagaat atggtgggcc cgctgtacat ttagtcaatg 25500 aaattggatt aaactgtctc actcacgctg ctaatgggcc tgactttgaa actatgaatg 25560 gaaactcttc tatgctattt tttaatatgt caaaaaagta gtacccatgg actggttttg 25620 gtgcattctg cattatttta aataacatag aaaatatttt tttaaaacct ttctatcacg 25680 atctctggcc ctacagtttc actgacaagg ctattcactt attcactcaa taaaaaggta 25740 tcttcaaact tactatgtgc caggaacagt atagttacta taaagacaaa tacaaatacc 25800 tgccgggaag tcacaatcta gtgacagacc cagaaacata tgattaacaa gagaaggcag 25860 tggtaatttt agagacaggg acaaagtgat ctggggacta gcacccatga gacagcaagg 25920 ctaaaccccg tggcacagat aacaagtcct ttcaaagttc acaaccccac aagtgccatc 25980 agttccccta gaggggagtg tgttttcctt ccgtgcagga atttaggcca tggctaaatg 26040 accatctgtc agagatgctg gaggtgagga ttccccccaa gtgagaggct ggattaagtg 26100 tcctttaaag ccccttccaa ctctaaaatt gtataattcc atgagatcag caaaggctaa 26160 aacagtaata ggaagcttca gagaggaagg gaaacttgaa ctaagcccaa aatgatagcc 26220 aggacctaag taaacaagac agagcaaaag atccctcatt ttattccttt tcattaaata 26280 tatgccaggt cagtctgcaa agcttaatgg gattaatgac tggcacagtc aggaagattt 26340 ccatagctac tgaatactgg gaaatacagc cagaattcta cattttacta gaaaataata 26400 acagtaggct tttctgatca catatgtaag aaaatccttt taacaaaaca ttaacaatac 26460 tatgtaaaat agtgacaggc ctactgcact atattctcaa taatcacata attacactta 26520 atgaaatata gagcacgcaa gaagatctgg catgagttta cattattatt tgataaggaa 26580 cttaaacagc caggccctga aacttagata atttggatat gtttctcatc agatttaaaa 26640 aagcttaaca ctgtgatttt ccatttcaag taattctcaa cccttttctg cctccctacc 26700 tgtcccacca ctcgtgctca cacttcttag tcaagacatc ttcactttac aataccagga 26760 tgtacacagg agaccccaag aaaggtaagt tagccaagac cgtattttaa aagccaagaa 26820 ctaaaatatc ttttaaaggt tctccagtat tcacaggaat acacagtctc ctgacaacat 26880 atcatgtcta gcaggtcact acactcatta aaaagttaac tccgtgttga gattccaata 26940 ctctcagtat attaaaccaa aggctagcat ttttactgag ggtttcagtg caccaaattt 27000 tactgagtgt ttcagtgcat catcactgcc aggtgcagtg gctcacacct gtaatcccag 27060 cactctggga ggtgaattac ctgaggtcag gagttcgaga ccagcctggc caacacagcg 27120 aaaccctgtc tctactgaaa atacaaaaat tagccgggac tggtggctca tgcctgtaat 27180 cccagctact cgggagtttg aggtgggaga atcgcttgaa cccaggaggc agaggttgta 27240 gtaagccact gtactccagc ctgggcaaca gagtgagact ccgtcaaaaa aaaaaaaatt 27300 gtgatggaaa tttagctaca cataaaaaat gagtgtgctg gagaagaact ggaggaatgt 27360 gccagcaagg tgatggccct acagccaaga catcaagaca gagccaccca aactgctata 27420 atttcagaaa tgagaggcag gtcagtgaat ctgtgttaag cagaaatgct actgttttag 27480 ttctattatc agcatcagtt attcaaacac tattggttca attcctaact tacccctttt 27540 tctgagaggt tcagagtaag caagtgcaag gtcctagtat gcgatgactt ccagtctaca 27600 ggcatcacat tcccataatt atctgtcagg gcattccaaa tccttaaaaa gaagaaagct 27660 acaatcaatg aaaattaaca ttctgtatgt ctttcttaaa caaatattca gtgaaagatg 27720 aacagcaaat gtcaaaggaa acagagaaaa acagacagca cattgtttct cagaaaaaca 27780 tcaacaaagt agagtctcct aaacaaaaac gaggtgaaaa atcacataac tgggctgaca 27840 agaagtccta agggaaggtg aagccgtaca gaaaactgcc aataggcagt aggcaacact 27900 tccattatat taggtgctag gcactgttct aagcaattta catgtattaa ctcatgtaat 27960 cttcaaaaac aacattagaa gggggaacca ttattaacct cattttacag atgaagaaac 28020 tgaaacacag ggcggggaag aaatgtttcc agggtcacat gaataataaa tacaactgtg 28080 cattacactt atttttcttc tgtaacttca aacaaacaga ccatttcaaa aagtgaaaaa 28140 tctctctcaa acaaaaggtt tgttatcctt tctggagaaa gaattcgctc tcaatgtgag 28200 aacaggattt aatatctaat ttgtaataaa gaagctaagc actaaggcag atacaacaaa 28260 aaggagaaag agagggagtt cacaataaaa cacacccagg ctctatccaa tgcagttacc 28320 tgaccttatt acccaacagc attacaggga ggcaagaggg tcaaatggta agggcatttg 28380 actgagacta gggaaacaaa ctctcatctc catccttctt caaataacga gtcattggcc 28440 aggcccctta acctccctag gtatcctctg cccttctata aatgagagac ctggacaaga 28500 agcagatact ccttccagct ccattactct aagcattaat tcaaatgagg tgtattgaag 28560 agagtcccaa accacaccag ataagaacac caaactcaaa tttcacaacc aaaattcagg 28620 tttcaatacc aaagtaataa tcagcacaat aaaacatgat gttttatgaa aaactaaagt 28680 tcagagatac aagtgacttg aaatctaatt gggatacgta aaatcaaaca ctaaacatac 28740 aagctactta ctagctagag gtatacacaa agatgtcatg tagaaaaaaa caccctaaaa 28800 tgtttattag gtatctaatt gtgaataagc aataggcttt ctttccaagt gtgtaagaca 28860 ctaggataca aaaacaatcc tggctggggc ataaacataa gaatgaattt tttttttaag 28920 ttccggggta catgggcagg atgtgcaggt ttgttacaca ggcaaacatg tgccatggtg 28980 atttgctgca cctatcaacc catcacctag gtattaagcc caagaatgaa aattttaaaa 29040 tacagacata acatagttaa caaataggga ttgggcgcta aaaagaagca ataggactgc 29100 aacggctaaa agcataacat cccaaatgat gtttggcttt aaaatgcacg ttagggatgg 29160 gcctggtggc tcatgcctgt aatcccagca ttttgggaag ccaaagtggg aggattgctt 29220 gaagccagga gttcaagacc agcctgggca acaaaaggga gccccacatc tctacaaaaa 29280 aaaaaaaaaa aaaaaaattg gccaggcctg tggcatgtgc cgcagtccca gctacttttg 29340 gagggtaagg caggaggact gcttaagccc aggagctcaa ggctgtagtg agctatgatg 29400 gcaccactgt actccagcct gggagacaga gcaagaccct gtctctaaaa atatataaat 29460 aaaataaaaa ataaaattag ccttaataaa aatgtgaacc caaaaatctc tagagaacta 29520 actcatctac ccagcaaagc tcccatatgc acagtccatt cctattctcc tctctgttgg 29580 aatatctcct ctaatgccag tgaagtagaa ttagaggcaa tgtctgtgga ctactaaatt 29640 tgctgggttt ctgtggacta ctaaacttgc tgggtttctg gctcatacct tcctggataa 29700 gaaattttca aacactgaat gactgtatag tccattttca tgcagctgat aaagacatac 29760 ccaagactgg gacgaaaaag aggtttaatt ggacttacag ttccacatgg ctgaggaggc 29820 ctcagaatca tggtgggaga caaaaggcac ttcttacatg gtggcagcaa gagaaaagga 29880 ggaagatgca aaagcagaaa cccctgataa actcatggac agcggccctc ttctcacaac 29940 cccactaggc agtgccccag tagggactct gtgtgggggc tccaacccca catttccctt 30000 ccacactgcc ctagcagagg ttctccatga gggccctgcc cctgcaacaa acttttgcct 30060 gggcatccag gcatttccat acatcttctg aatctagaag gaggttccca aacctcaatt 30120 cttgacttct gtgcacctgt gggctcaaca ccacatggaa gctgccaagg cttggggctt 30180 gcaccctctg aagccacagc ctgagctgta ctctcgcccc tttcagccac agctggagca 30240 gctggtacac agggcaccaa gtccctaggc cgcatacagc aggcagaccc taggcccagc 30300 tcacgaaacc acttgttcct ttttgggcct ccaggcctgt gatgggagga gctgccatga 30360 aggtctctga catggcctgg agacattttc cccatggtcg tggggattaa cattagggtc 30420 cttgctactt acgcaaattt ctgcagctag cttgaatttc tccccagaaa atgggttttt 30480 tttttttcta tcgcatagtc aggctgcaaa ttttccaaac ttttatgctc tgcttccctt 30540 ataaaactga atacgtttaa tagcacccaa gttacctcct gaatgcttta ctgcttagaa 30600 attttttctg ccagataccc taactcatct ccctcaagtt caaagttcca caaatctcta 30660 gggcaggggc aaaatgccac cagttctctt tgccaaaaca taacaagggt catctttgct 30720 ccagttccca acaacttcct catctccatc tgagaccacc tcagcctgga ttttattgtc 30780 catattgctg tcagcatttt gggaaagcca ttcaacaagt ctctaggaag ttccaaactt 30840 tcccacattt tcctgtcttc tgagcccttc aaactattcc aatctctacc tgttccccag 30900 ttccaaagtc atttccacat tttcaggtat cttttcagca acgccccact ctactggtac 30960 taacttacta tattaggtta ttttcatgca gctgataaag acatacccga gactgggaag 31020 aaaacgaggt ttaattggac ttacagttcc acatggctgg ggaggcctca gaatcatggc 31080 aggaggcaaa aggcacttct tacatggcgg cagcaaggga aaatgaggaa ggagcaaaag 31140 cggaaacccc tgataagccc atcagatctt gtaagactta ttcactatca tgagaatagc 31200 ataggaaaga ccagccccca tgattcaatt acctccccct ggatccctcc cacaacacgt 31260 gggaattctg ggagatacaa ttcaagttga gatttgggtg gggacacagc caaaccgtat 31320 cactgactat agacctaatc ctggcaggaa tttaagcagg atacactagc aggtgaagac 31380 agtccagaat ttgaggagtt tgctttgtct gctgatttgc tccttcttaa acaaggctat 31440 aagaaaaagg caaatgggat ggaatatttc acttagaagt gcctgtcagt gagtttgatc 31500 ttccaagaaa atgaaagcca actaacagtc tccaccaagg tcaaaacaat ggtagttgga 31560 aggacacaga ggcaactcta gtgaaagatt tctagaattc tgcagtcaag acatatgaac 31620 tgccttcatc tttatcacca agtattacaa ccagtcgtaa acaacggtca ggacagctct 31680 tttgcaagga ggtcaaagtc ctgccaacat cgaatcctgc agaatactct tataagtaaa 31740 acctcagtac gtaaatacaa tccatattta accaaaacaa aagctgaggc tcagagaagt 31800 taaatgaagt gccaagggtc ggggagggct ttggggtgat tcaagcacat tacatttacg 31860 gtgtacttta tttctattat tacactgtca tatatcatga aataatttta caactcacca 31920 taacgtagaa tcaatgggag ccctgagctt gttttcctgc aactagatgg tcctatctgg 31980 gggagagggg aaacagtgac agatcatcaa gcattagatt ctcataaggt gtctgcaaac 32040 tagaaccctc acttgtgcag ttcacaatag ggttcacgct cctatgagaa tctaatgccg 32100 tggctgatct gacaggaggt ggctcaggtg gtaatgcaag caatggggag gagctgtaaa 32160 tacagatgag gctttgctca ctcactcagc cactcacctc ctgtagtgtg gcccagttcc 32220 agtaccggtc tgtggaccag gggttgggga ccccttgacc taaagcacag caatatctga 32280 gtaaaacaat tatcaattat atctagtgaa aagggaactg gaacagacta tcaagcaatg 32340 atgagtatgt ttttgcaata taaaagggac aatgccagtg atgaccagct cccaactccc 32400 ggaaaaaaag agatttcata tgcattccca agtcaagaga catgcattac caccaacata 32460 tcatctacta aataccatta acaagagcga caaaatattt ttaagcaaag gctaaaataa 32520 cctactctcc tgacggtcta agtgaaacaa aaactcttaa cgatctacgt acaatcctct 32580 aattgattta agtcagctac tcttgaaaaa cccaaataca taatacatct acactaattt 32640 ttaaatccag aatgtgtcct gggtttagtg attcagagta tctaaatcta catatacata 32700 aaattatctt ctctttggga acaggcttcc aaaaatgtca tcttttgcta ctgacggcat 32760 gcatgtgtgt ataaaatcca attataagtg tttcccttct ctacacctgg gcagaacact 32820 gtactgagga accaataaaa ggagatggtc tccaaacaca ggtgggcctg caccactgct 32880 cagccatgtt tacaggcccc actcccaagg ctgtctttac aagcttccta ccttattcct 32940 gataaatctg tttgtttcac aaaggaaagg agacaatcag gctggaaact cctttccgtg 33000 cccgtgactc ctgtcctgtc tctcaaaaat tcttccctaa cctctaagag tctctccatc 33060 ccatcaatgc tctgaaaatc cattccttcc catgtcctca tttttgagac aaagtctcgc 33120 tctgacaccc aggctggagt gcagttgtgc gatgatggct cactgcaacc ttcgcctcca 33180 ggactcaagc aatcctgcca tctcagcctc cgggcagctg ggtccgtggg accacggggc 33240 atgcgccacc ggacttcact aatttttttt taattttttt gtaaagacca gatttcacga 33300 tattgcccag tctggtctcg aactgctgga agcaagccat cctcccacca cagcctctca 33360 aagtgctggg atcacaggtg tgagccaccg cgcccaatct tattccactc attttacctt 33420 cagcctctcc ggaggctcct ctccccctat atacacaaca tgctcaagtc cctttactct 33480 caccattcct atttaccatc ttttgggggt aaaaccatgg tacctaatta caactcttac 33540 ttaaccaaat tataattgcg gctcaggtgg tttaaaaagt gccctcagga actctgcgcc 33600 tcacctctcc cagctgtgat aaaaccccgt gaacctccta tcattgctct taggtgattc 33660 tgtaattatt aatattactt gtgtatgcct tggtctcccc accagaccat gcgatcgtta 33720 ccatcagtaa gcagggccct tcgttttcgt ctctgtggcg gctcgcactc taccagggct 33780 cccgggtggg tcccacagat tcctgtgccc taaatgctgc ctcgcacgtt ggtacttatg 33840 cacaggcaat atactccacc tagaacgtta gacttttttc cacctgccag gcacactccc 33900 actcatcagt cagatctcgt ttgcacattt ctttataaag tcgttcctta ctctccccaa 33960 gacacacgct tttaagtctc tcacatttgg tacatataac cattgcagcg ctaattacat 34020 ccatacatgt atttgtttac aaatctatct cccatcctac gttgtgtcct aagtgggagg 34080 gacagtgccc agtcgttttg gtgttccctg tgcgaagcag cgcctggcac tcagcaggtg 34140 cttaataaat gcgacttgag tgaatgaatg gcgacacacg tgccccagga ggtgccccag 34200 gaggtgccgt ctctaactat gggctctgca tgggcgattc ccttctaata caaacgaggg 34260 tttggtttgc aggaaggagg gcgggaggta ccgaggcgga ggttttcagg gctcgtggcg 34320 gcggggggca ggggatttaa attgctgggc gagccgaggc cgacggccgg aggaaggagg 34380 cgagagggcg gggactcccg cgtccgggcg gaaggagggc gcagggagga gctgcggccg 34440 tagggctgcc ggtcgggcga ggggtagggg aggctcccgc ccactcccgc cggggccccg 34500 cactcactcg tccccctgca ggaggctgcg ctccgtgatg ggccgcacgg ccgtcctcgg 34560 gggctcggcg cccggatccg aggggcggaa gcacaggctc agcttctcct ccaagctgca 34620 ggccggggcc gggaaaccgt cggcgccccc acccgcctcg gcccccgcct ccgccccagg 34680 ctcggggctc gcgttacagt tctcctggtc caggaggctc cgggccggct cctggaactc 34740 atagaaatcc tgccagtccc cgtccgccgc catcgccgcc cggagcagtc gcgcgccccg 34800 cccaggccgg cccagcccgc ccagcccggc cccgccccgg cccgaagccc cgcccccgcc 34860 cgctaccgtg ctgacagatc ccaggctgtc ccaaccctgg gtctccggtg gcctgagtct 34920 tcctggagtt gaacgcggtg ctactagaaa gccaagacct taactgcctt gagtcacttc 34980 ctatattctc tatcatttag gaataacttc cctgttagct tcccttatta gcaacttggc 35040 ctaaatgtgg ctaaaataaa ggacttcttt aaacttacga cctaatatca gattacgccc 35100 cgaaagtcgt tttccccaag gtaacctgct cctttcctaa cttgccaacc agaataacag 35160 ccagagtcaa tgggaaatgt ttaggtggta ccgaagttca ttcctcaaat attttacctc 35220 tcacgtgcaa agcactggct agatcctgca gaacatacaa agataattat gccagttaac 35280 atttgcaaag cacttagagc aatgcctaat agtagatacc ctatgtgtat gttataacta 35340 atacatgggt gccccataca cagggcttac aatgtcggag tgataagagt aacaagaaat 35400 tgtttagacc tcataaaaac taagtgtgat tgggcggttc agttttgcaa gatgaaaaga 35460 gttatggaga tggatggtgg tgatggttcc acgacagtgt taatgtgctg aatgccactg 35520 atctatacac ttacagatgg ttaagatagt aattttatgt ttattttact acaatacaaa 35580 aaaattggag taggccaggc gaggtggctc acatctataa tcccagcact ttggcaggcc 35640 caggtggacg gatcgcttga ggtcaggagt tcaagaccaa cctggccaac atggtaaaac 35700 cccgtctcta ctaaaaatac aaaatctagc tgtcaggcct ctgagcccaa gccaagccat 35760 cgcatcccgt gacttgcaca tatacaccca gatggcctga agtaactgaa gatccacaaa 35820 agaagtaaaa atagccttaa ggatgacatt ccaccattgt gatttgttcc tgccccaccc 35880 taactgatca atgtactttg taatctcccc cacccttaag aagtttcttt gtaattctcc 35940 ccacccttga gaatgtactt tgtgagatcc acccctgccc gcaaaacatt gctcttaact 36000 tcaccgcctg acccaaaacc tgtaagaact aatgataatc catctccctt cgctgactct 36060 cttttcggac tcagcccgcc tgcatccagg tgaaataaac agccgtgttg ctcacacaaa 36120 gcctgtttgg tggtctcttc acacggacgc acatgaaatt tggtgccgtg actcggatcg 36180 ggggacctcc cttgggagat caatccccct gtcctcctgc tctttgctcc atgagaaaga 36240 tccacctacg acctcaggtc ctcagaccga ccagcccaaa aaacatctca ccaatttcaa 36300 atc 36303 

1. An isolated nucleic acid which encodes a polypeptide comprising the amino acid sequence PGECCPLP (SEQ ID NO: 1).
 2. The isolated nucleic acid of claim 1, which comprises a sequence of nucleotides selected from the group consisting of SEQ ID NO: 5, SEQ ID NO: 6 and SEQ ID NO:
 7. 3. The isolated nucleic acid of claim 1, which comprises a nucleotide sequence as set forth in FIG. 3 (SEQ ID NOS: 20-24).
 4. The isolated nucleic acid of claim 1 as deposited in pcDNA3-hCRIM1myc under accession number NM00/16530 at AGAL on Nov. 9,
 2000. 5. A nucleic acid homolog of the isolated nucleic acid of claim 2, wherein said homolog is capable of hybridizing to said isolated nucleic acid under at least medium stringency wash conditions.
 6. An expression construct comprising the isolated nucleic acid of claim
 1. 7. A host cell comprising the expression construct of claim
 6. 8. A pharmaceutical composition for gene therapy comprising the expression construct of claim 6 and a pharmaceutically acceptable carrier, diluent or excipient.
 9. An isolated polypeptide encoded by the isolated nucleic acid of claim
 1. 10. An isolated polypeptide comprising the amino acid sequence PGECCPLP (SEQ ID NO: 1).
 11. An isolated polypeptide according to claim 10, further characterized by the presence of six cysteine-rich domains, an RGD domain, an IGFBP-like domain and a transmembrane domain.
 12. The isolated polypeptide of claim 10, wherein the polypeptide comprises an amino acid selected from the group consisting of the sequences set forth in SEQ ID NO: 2, SEQ ID NO: 3 and SEQ ID NO:
 4. 13. A biologically-active fragment, variant or derivative of the polypeptide of claim
 12. 14. The biologically-active fragment of claim 13, which consists essentially of an amino acid sequence selected from the group consisting of: (i) amino acids 1-901 of SEQ ID NO: 2; (ii) SEQ ID NO: 8; and (iii) SEQ ID NO:
 9. 15. An antibody capable of binding the isolated polypeptide of claim
 9. 16. The antibody of claim 15, which antibody is capable of binding a peptide having an amino acid sequence selected from the group consisting of: (i) SEQ ID NO: 8; and (ii) SEQ ID NO:
 9. 17. A pharmaceutical composition comprising the isolated polypeptide, biologically active fragment, variant or derivative thereof according to claim 9 and a pharmaceutically acceptable carrier, diluent or excipient.
 18. A mimetic which antagonizes or mimics a biological activity of a CRIM1 polypeptide according to claim
 9. 19. A method of modulating the biological activity of a polypeptide of the BMP family, said method including the step of administering to an animal a pharmaceutical composition according to claim
 8. 20. The method of claim 19, wherein the biological activity of the polypeptide of the BMP family is increased.
 21. The method of claim 19, wherein the biological activity of the polypeptide of the BMP family is decreased.
 22. The method of claim 21, wherein the pharmaceutical composition is administered to the animal to prophylactically or therapeutically treat an eye disease.
 23. The method of claim 22, wherein the eye disease is cataract formation.
 24. A method of modulating the biological activity of a polypeptide of the BMP family, said method including the step of administering to an animal a pharmaceutical composition according to claim
 17. 25. The method of claim 24, wherein the biological activity of the polypeptide of the BMP family is increased.
 26. The method of claim 24, wherein the biological activity of the polypeptide of the BMP family is decreased.
 27. The method of claim 26, wherein the pharmaceutical composition is administered to the animal to prophylactically or therapeutically treat an eye disease.
 28. The method of claim 27, wherein the eye disease is cataract formation.
 29. A method of determining whether an animal is predisposed to a genetically-heritable disease, which method includes the steps of:— (i) obtaining a nucleic acid sample from said animal; and (ii) determining whether said nucleic acid sample includes a mutation or polymorphism in an isolated nucleic acid according to claim 1, said mutation or polymorphism being indicative of said animal being predisposed to, or suffering from, said genetically-heritable disease.
 30. The method of claim 19, wherein the animal is a mammal.
 31. The method of claim 30, wherein the mammal is a human.
 32. The method of claim 29, wherein the animal is a mammal.
 33. The method of claim 32, wherein the mammal is a human.
 34. An isolated polypeptide encoded by the nucleic acid homologue of claim
 5. 